[go: up one dir, main page]

Follow
Benoît Sagot
Benoît Sagot
Directeur de recherches at Inria, head of the ALMAnaCH team
Verified email at inria.fr - Homepage
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
23172022
What does BERT learn about the structure of language?
G Jawahar, B Sagot, D Seddah
57th Annual Meeting of the Association for Computational Linguistics (ACL …, 2019
19562019
CamemBERT: a Tasty French Language Model
L Martin, B Muller, PJ Ortiz Suárez, Y Dupont, L Romary, ...
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
15412020
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
PJ Ortiz Suárez, B Sagot, L Romary
Challenges in the Management of Large Corpora (CMLC-7) 2019, 9, 2019
606*2019
The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French
B Sagot
LREC 2010, 2010
320*2010
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
PJ Ortiz Suárez, L Romary, B Sagot
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
3142020
Building a free French wordnet from multilingual resources
B Sagot, D Fišer
Ontolex 2008, 2008
2662008
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D Van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
2642022
Towards a cleaner document-oriented multilingual crawled corpus
J Abadji, PO Suarez, L Romary, B Sagot
arXiv preprint arXiv:2201.06642, 2022
2342022
Controllable sentence simplification
L Martin, ÉV De La Clergerie, B Sagot, A Bordes
Proceedings of the twelfth language resources and evaluation conference …, 2020
2162020
Between words and characters: A brief history of open-vocabulary modeling and tokenization in nlp
SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ...
arXiv preprint arXiv:2112.10508, 2021
2152021
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
F Alva-Manchego, L Martin, A Bordes, C Scarton, B Sagot, L Specia
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
2102020
MUSS: Multilingual unsupervised sentence simplification by mining paraphrases
L Martin, A Fan, ÉV De La Clergerie, A Bordes, B Sagot
Proceedings of the thirteenth language resources and evaluation conference …, 2022
202*2022
When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models
B Muller, A Anastasopoulos, B Sagot, D Seddah
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
1872021
Coupling an annotated corpus and a morphosyntactic lexicon for state-of-the-art POS tagging with less human effort
P Denis, B Sagot
PACLIC 2009, 2009
1772009
Generative Spoken Dialogue Language Modeling
TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ...
arXiv preprint arXiv:2203.16502, 2022
1652022
Universal dependencies 2.5
D Zeman, J Nivre, et al.
LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied …, 2020
1602020
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
B Sagot, L Clément, E de La Clergerie, P Boullier
LREC 2006, 2006
1202006
SpiRit-LM: Interleaved Spoken and Written Language Model
TA Nguyen, B Muller, B Yu, MR Costa-Jussa, M Elbayad, S Popuri, ...
Transactions of the Association for Computational Linguistics 13, 30-52, 2025
1172025
Influence of pre-annotation on POS-tagged corpus development
K Fort, B Sagot
The fourth ACL linguistic annotation workshop, 56-63, 2010
1132010
The system can't perform the operation now. Try again later.
Articles 1–20