[go: up one dir, main page]

Follow
Stephen Roller
Stephen Roller
Thinking Machines
Verified email at thinkingmachines.ai - Homepage
Title
Cited by
Cited by
Year
Opt: Open pre-trained transformer language models
S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ...
arXiv preprint arXiv:2205.01068, 2022
5251*2022
Recipes for building an open-domain chatbot
S Roller, E Dinan, N Goyal, D Ju, M Williamson, Y Liu, J Xu, M Ott, ...
Proceedings of the 16th conference of the european chapter of the …, 2021
13452021
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13372025
Wizard of wikipedia: Knowledge-powered conversational agents
E Dinan, S Roller, K Shuster, A Fan, M Auli, J Weston
arXiv preprint arXiv:1811.01241, 2018
11582018
Neural text generation with unlikelihood training
S Welleck, I Kulikov, S Roller, E Dinan, K Cho, J Weston
arXiv preprint arXiv:1908.04319, 2019
7222019
Human-level play in the game of Diplomacy by combining language models with strategic reasoning
Meta Fundamental AI Research Diplomacy Team (FAIR)†, A Bakhtin, ...
Science 378 (6624), 1067-1074, 2022
4212022
Blenderbot 3: a deployed conversational agent that continually learns to responsibly engage
K Shuster, J Xu, M Komeili, D Ju, EM Smith, S Roller, M Ung, M Chen, ...
arXiv preprint arXiv:2208.03188, 2022
3762022
Supervised Text-based Geolocation Using Language Models on an Adaptive Grid
S Roller, M Speriosu, S Rallapalli, B Wing, J Baldridge
3012012
What makes a good conversation? how controllable attributes affect human judgments
A See, S Roller, D Kiela, J Weston
arXiv preprint arXiv:1902.08654, 2019
2972019
Hash layers for large sparse models
S Roller, S Sukhbaatar, J Weston
advances in neural information processing systems 34, 17555-17566, 2021
2642021
Inclusive yet selective: Supervised distributional hypernymy detection
S Roller, K Erk, G Boleda
Proceedings of COLING 2014, the 25th international conference on …, 2014
2382014
Don’t say that! making inconsistent dialogue unlikely with unlikelihood training
M Li, S Roller, I Kulikov, S Welleck, YL Boureau, K Cho, J Weston
Proceedings of the 58th annual meeting of the association for computational …, 2020
2302020
Acute-eval: Improved dialogue evaluation with optimized questions and multi-turn comparisons
M Li, J Weston, S Roller
arXiv preprint arXiv:1909.03087, 2019
1912019
Hearst patterns revisited: Automatic hypernym detection from large text corpora
S Roller, D Kiela, M Nickel
arXiv preprint arXiv:1806.03191, 2018
1842018
Scaling laws for generative mixed-modal language models
A Aghajanyan, L Yu, A Conneau, WN Hsu, K Hambardzumyan, S Zhang, ...
International Conference on Machine Learning, 265-279, 2023
1582023
MGNC-CNN: A simple approach to exploiting multiple word embeddings for sentence classification
Y Zhang, S Roller, BC Wallace
Proceedings of the 2016 Conference of the North American Chapter of the …, 2016
1522016
Language models that seek for knowledge: Modular search & generation for dialogue and prompt completion
K Shuster, M Komeili, L Adolphs, S Roller, A Szlam, J Weston
arXiv preprint arXiv:2203.13224, 2022
1432022
A multimodal LDA model integrating textual, cognitive and visual modalities
S Roller, SS Im Walde
Proceedings of the 2013 Conference on Empirical Methods in Natural Language …, 2013
1252013
Inferring concept hierarchies from text corpora via hyperbolic embeddings
M Le, S Roller, L Papaxanthos, D Kiela, M Nickel
Proceedings of the 57th annual meeting of the association for computational …, 2019
1072019
Adding chit-chat to enhance task-oriented dialogues
K Sun, S Moon, PA Crook, S Roller, B Silvert, B Liu, Z Wang, H Liu, E Cho, ...
Proceedings of the 2021 conference of the North American chapter of the …, 2021
972021
The system can't perform the operation now. Try again later.
Articles 1–20