Newman et al., 2021 - Google Patents

Refining targeted syntactic evaluation of language models

Newman et al., 2021

Document ID: 9701242770026513623
Author: Newman B; Ang K; Gong J; Hewitt J
Publication year: 2021
Publication venue: arXiv preprint arXiv:2104.09635

External Links

Cited by

Snippet

Targeted syntactic evaluation of subject-verb number agreement in English (TSE) evaluates language models' syntactic knowledge using hand-crafted minimal pairs of sentences that differ only in the main verb's conjugation. The method evaluates whether language models …

Continue reading at arxiv.org (PDF) (other versions)

238000011156 evaluation 0 title abstract description 16

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models

Similar Documents

Publication	Publication Date	Title
Newman et al.	2021	Refining targeted syntactic evaluation of language models
Hendrycks et al.	2021	What would jiminy cricket do? towards agents that behave morally
Guo et al.	2018	Soft layer-specific multi-task summarization with entailment and question generation
Van Stegeren et al.	2021	Fine-tuning GPT-2 on annotated RPG quests for NPC dialogue generation
RU2451999C2 (en)	2012-05-27	Optimisation of fact extraction using multi-stage approach
Ullman	1999	Acceptability ratings of regular and irregular past-tense forms: Evidence for a dual-system model of language from word frequency and phonological neighbourhood effects
Jurgens et al.	2014	It’s all fun and games until someone annotates: Video games with a purpose for linguistic annotation
Ulmer et al.	2024	Bootstrapping llm-based task-oriented dialogue agents via self-talk
Murdock	2006	Aspects of sentence retrieval
Baughman et al.	2013	Deepqa jeopardy! gamification: a machine-learning perspective
Swirski	2013	From literature to biterature: Lem, Turing, Darwin, and explorations in computer literature, philosophy of mind, and cultural evolution
Liu et al.	2023	Visual storytelling with question-answer plans
Dingare	2001	The effect of feature hierarchies on frequencies of passivization in English
DE112021000598T5 (en)	2022-12-01	EXPANDABLE DICTIONARY FOR GAME EVENTS
Dziedzic	2016	Use of the Free to Play model in games with a purpose: the RoboCorp game case study
CN113393063A (en)	2021-09-14	Match result prediction method, system, program product and storage medium
Wang et al.	2021	From eSports Data to Game Commentary: Datasets, Models, and Evaluation Metrics
Agnew et al.	2023	The Mechanical Bard: An Interpretable Machine Learning Approach to Shakespearean Sonnet Generation
Sundström	2020	How not to write a thesis or dissertation: a guide to success through failure
Greene	2023	Writing with Style: The Economist Guide
Forgács	2004	Grammaticalisation and preverbs
Grace	2014	A linguistic analysis of mobile games: Verbs and nouns for content estimation
Guo	2018	Human-Earth System Dynamics: Implications to Civilizations
Hung et al.	2022	Construction and research of e-sports speech emotion recognition model
Adams	1885	The Greek Prepositions: Studied from Their Original Meanings as Designations of Space