[go: up one dir, main page]

Newman et al., 2021 - Google Patents

Refining targeted syntactic evaluation of language models

Newman et al., 2021

View PDF
Document ID
9701242770026513623
Author
Newman B
Ang K
Gong J
Hewitt J
Publication year
Publication venue
arXiv preprint arXiv:2104.09635

External Links

Snippet

Targeted syntactic evaluation of subject-verb number agreement in English (TSE) evaluates language models' syntactic knowledge using hand-crafted minimal pairs of sentences that differ only in the main verb's conjugation. The method evaluates whether language models …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models

Similar Documents

Publication Publication Date Title
Newman et al. Refining targeted syntactic evaluation of language models
Hendrycks et al. What would jiminy cricket do? towards agents that behave morally
Guo et al. Soft layer-specific multi-task summarization with entailment and question generation
Van Stegeren et al. Fine-tuning GPT-2 on annotated RPG quests for NPC dialogue generation
RU2451999C2 (en) Optimisation of fact extraction using multi-stage approach
Ullman Acceptability ratings of regular and irregular past-tense forms: Evidence for a dual-system model of language from word frequency and phonological neighbourhood effects
Jurgens et al. It’s all fun and games until someone annotates: Video games with a purpose for linguistic annotation
Ulmer et al. Bootstrapping llm-based task-oriented dialogue agents via self-talk
Murdock Aspects of sentence retrieval
Baughman et al. Deepqa jeopardy! gamification: a machine-learning perspective
Swirski From literature to biterature: Lem, Turing, Darwin, and explorations in computer literature, philosophy of mind, and cultural evolution
Liu et al. Visual storytelling with question-answer plans
Dingare The effect of feature hierarchies on frequencies of passivization in English
DE112021000598T5 (en) EXPANDABLE DICTIONARY FOR GAME EVENTS
Dziedzic Use of the Free to Play model in games with a purpose: the RoboCorp game case study
CN113393063A (en) Match result prediction method, system, program product and storage medium
Wang et al. From eSports Data to Game Commentary: Datasets, Models, and Evaluation Metrics
Agnew et al. The Mechanical Bard: An Interpretable Machine Learning Approach to Shakespearean Sonnet Generation
Sundström How not to write a thesis or dissertation: a guide to success through failure
Greene Writing with Style: The Economist Guide
Forgács Grammaticalisation and preverbs
Grace A linguistic analysis of mobile games: Verbs and nouns for content estimation
Guo Human-Earth System Dynamics: Implications to Civilizations
Hung et al. Construction and research of e-sports speech emotion recognition model
Adams The Greek Prepositions: Studied from Their Original Meanings as Designations of Space