Newman et al., 2021 - Google Patents
Refining targeted syntactic evaluation of language modelsNewman et al., 2021
View PDF- Document ID
- 9701242770026513623
- Author
- Newman B
- Ang K
- Gong J
- Hewitt J
- Publication year
- Publication venue
- arXiv preprint arXiv:2104.09635
External Links
Snippet
Targeted syntactic evaluation of subject-verb number agreement in English (TSE) evaluates language models' syntactic knowledge using hand-crafted minimal pairs of sentences that differ only in the main verb's conjugation. The method evaluates whether language models …
- 238000011156 evaluation 0 title abstract description 16
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Newman et al. | Refining targeted syntactic evaluation of language models | |
| Hendrycks et al. | What would jiminy cricket do? towards agents that behave morally | |
| Guo et al. | Soft layer-specific multi-task summarization with entailment and question generation | |
| Van Stegeren et al. | Fine-tuning GPT-2 on annotated RPG quests for NPC dialogue generation | |
| RU2451999C2 (en) | Optimisation of fact extraction using multi-stage approach | |
| Ullman | Acceptability ratings of regular and irregular past-tense forms: Evidence for a dual-system model of language from word frequency and phonological neighbourhood effects | |
| Jurgens et al. | It’s all fun and games until someone annotates: Video games with a purpose for linguistic annotation | |
| Ulmer et al. | Bootstrapping llm-based task-oriented dialogue agents via self-talk | |
| Murdock | Aspects of sentence retrieval | |
| Baughman et al. | Deepqa jeopardy! gamification: a machine-learning perspective | |
| Swirski | From literature to biterature: Lem, Turing, Darwin, and explorations in computer literature, philosophy of mind, and cultural evolution | |
| Liu et al. | Visual storytelling with question-answer plans | |
| Dingare | The effect of feature hierarchies on frequencies of passivization in English | |
| DE112021000598T5 (en) | EXPANDABLE DICTIONARY FOR GAME EVENTS | |
| Dziedzic | Use of the Free to Play model in games with a purpose: the RoboCorp game case study | |
| CN113393063A (en) | Match result prediction method, system, program product and storage medium | |
| Wang et al. | From eSports Data to Game Commentary: Datasets, Models, and Evaluation Metrics | |
| Agnew et al. | The Mechanical Bard: An Interpretable Machine Learning Approach to Shakespearean Sonnet Generation | |
| Sundström | How not to write a thesis or dissertation: a guide to success through failure | |
| Greene | Writing with Style: The Economist Guide | |
| Forgács | Grammaticalisation and preverbs | |
| Grace | A linguistic analysis of mobile games: Verbs and nouns for content estimation | |
| Guo | Human-Earth System Dynamics: Implications to Civilizations | |
| Hung et al. | Construction and research of e-sports speech emotion recognition model | |
| Adams | The Greek Prepositions: Studied from Their Original Meanings as Designations of Space |