| Embedding entities and relations for learning and inference in knowledge bases B Yang, W Yih, X He, J Gao, L Deng arXiv preprint arXiv:1412.6575, 2014 | 4921 | 2014 |
| Deberta: Decoding-enhanced bert with disentangled attention P He, X Liu, J Gao, W Chen arXiv preprint arXiv:2006.03654, 2020 | 4380 | 2020 |
| Ms marco: A human-generated machine reading comprehension dataset T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, R Majumder, L Deng | 3209* | 2016 |
| Domain-specific language model pretraining for biomedical natural language processing Y Gu, R Tinn, H Cheng, M Lucas, N Usuyama, X Liu, T Naumann, J Gao, ... ACM Transactions on Computing for Healthcare (HEALTH) 3 (1), 1-23, 2021 | 3105 | 2021 |
| A diversity-promoting objective function for neural conversation models J Li, M Galley, C Brockett, J Gao, WB Dolan Proceedings of the 2016 conference of the North American chapter of the …, 2016 | 3092 | 2016 |
| On the variance of the adaptive learning rate and beyond L Liu, H Jiang, P He, W Chen, X Liu, J Gao, J Han arXiv preprint arXiv:1908.03265, 2019 | 2907 | 2019 |
| Ms-celeb-1m: A dataset and benchmark for large-scale face recognition Y Guo, L Zhang, Y Hu, X He, J Gao European conference on computer vision, 87-102, 2016 | 2805 | 2016 |
| Learning deep structured semantic models for web search using clickthrough data PS Huang, X He, J Gao, L Deng, A Acero, L Heck Proceedings of the 22nd ACM international conference on Information …, 2013 | 2718 | 2013 |
| Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks X Li, X Yin, C Li, P Zhang, X Hu, L Zhang, L Wang, H Hu, L Dong, F Wei, ... European conference on computer vision, 121-137, 2020 | 2652 | 2020 |
| Stacked attention networks for image question answering Z Yang, X He, J Gao, L Deng, A Smola Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 2544 | 2016 |
| Deep learning--based text classification: a comprehensive review S Minaee, N Kalchbrenner, E Cambria, N Nikzad, M Chenaghlu, J Gao ACM computing surveys (CSUR) 54 (3), 1-40, 2021 | 2493 | 2021 |
| Piqa: Reasoning about physical commonsense in natural language Y Bisk, R Zellers, J Gao, Y Choi Proceedings of the AAAI conference on artificial intelligence 34 (05), 7432-7439, 2020 | 2470 | 2020 |
| Unified language model pre-training for natural language understanding and generation L Dong, N Yang, W Wang, F Wei, X Liu, Y Wang, J Gao, M Zhou, HW Hon Advances in neural information processing systems 32, 2019 | 2118 | 2019 |
| Dialogpt: Large-scale generative pre-training for conversational response generation Y Zhang, S Sun, M Galley, YC Chen, C Brockett, X Gao, J Gao, J Liu, ... Proceedings of the 58th annual meeting of the association for computational …, 2020 | 1989 | 2020 |
| Grounded language-image pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1826 | 2022 |
| Deep reinforcement learning for dialogue generation J Li, W Monroe, A Ritter, D Jurafsky, M Galley, J Gao Proceedings of the 2016 conference on empirical methods in natural language …, 2016 | 1801 | 2016 |
| Large language models: A survey S Minaee, T Mikolov, N Nikzad, M Chenaghlu, R Socher, X Amatriain, ... arXiv preprint arXiv:2402.06196, 2024 | 1778 | 2024 |
| From captions to visual concepts and back H Fang, S Gupta, F Iandola, RK Srivastava, L Deng, P Dollár, J Gao, X He, ... Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 1768 | 2015 |
| Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing P He, J Gao, W Chen arXiv preprint arXiv:2111.09543, 2021 | 1745 | 2021 |
| A latent semantic model with convolutional-pooling structure for information retrieval Y Shen, X He, J Gao, L Deng, G Mesnil Proceedings of the 23rd ACM international conference on conference on …, 2014 | 1710* | 2014 |