| Convlab-2: An open-source toolkit for building, evaluating, and diagnosing dialogue systems Q Zhu, Z Zhang, Y Fang, X Li, R Takanobu, J Li, B Peng, J Gao, X Zhu, ... arXiv preprint arXiv:2002.04793, 2020 | 151 | 2020 |
| Convlab: Multi-domain end-to-end dialog system platform S Lee, Q Zhu, R Takanobu, Z Zhang, Y Zhang, X Li, J Li, B Peng, X Li, ... Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019 | 125 | 2019 |
| Not all layers of llms are necessary during inference S Fan, X Jiang, X Li, X Meng, P Han, S Shang, A Sun, Y Wang, Z Wang arXiv preprint arXiv:2403.02181, 2024 | 81 | 2024 |
| Flm-101b: An open llm and how to train it with $100 k budget X Li, Y Yao, X Jiang, X Fang, X Meng, S Fan, P Han, J Li, L Du, B Qin, ... arXiv preprint arXiv:2309.03852, 2023 | 38 | 2023 |
| Quantifying and attributing the hallucination of large language models via association analysis L Du, Y Wang, X Xing, Y Ya, X Li, X Jiang, X Fang arXiv preprint arXiv:2309.05217, 2023 | 31 | 2023 |
| Packet representation learning for traffic classification X Meng, Y Wang, R Ma, H Luo, X Li, Y Zhang Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 28 | 2022 |
| Tele-flm technical report X Li, Y Yao, X Jiang, X Fang, C Wang, X Liu, Z Wang, Y Zhao, X Wang, ... arXiv preprint arXiv:2404.16645, 2024 | 25 | 2024 |
| Named entity recognition using a semi-supervised model based on bert and bootstrapping Y Liu, X Li, J Shi, L Zhang, J Li China Conference on Knowledge Graph and Semantic Computing, 54-63, 2020 | 9 | 2020 |
| 52b to 1t: Lessons learned via tele-flm series X Li, Y Yao, X Jiang, X Fang, C Wang, X Liu, Z Wang, Y Zhao, X Wang, ... arXiv preprint arXiv:2407.02783, 2024 | 7 | 2024 |
| Sketch: A toolkit for streamlining llm operations X Jiang, X Li, W Ma, X Fang, Y Yao, N Yu, X Meng, P Han, J Li, A Sun, ... arXiv preprint arXiv:2409.03346, 2024 | 3 | 2024 |
| Freelm: Fine-tuning-free language model X Li, X Jiang, X Meng, A Sun, Y Wang arXiv preprint arXiv:2305.01616, 2023 | 3 | 2023 |
| RoboEgo System Card: An Omnimodal Model with Native Full Duplexity Y Yao, X Li, X Jiang, X Fang, N Yu, A Sun, Y Wang arXiv preprint arXiv:2506.01934, 2025 | 2 | 2025 |
| Cofenet: Context and former-label enhanced net for complicated quotation extraction Y Wang, X Li, A Sun, X Meng, H Liao, J Guo arXiv preprint arXiv:2209.09432, 2022 | 2 | 2022 |
| Egomem: Lifelong memory agent for full-duplex omnimodal models Y Yao, N Yu, X Li, X Jiang, X Fang, W Ma, X Meng, J Li, A Sun, Y Wang arXiv preprint arXiv:2509.11914, 2025 | 1 | 2025 |
| Flm-audio: Natural monologues improves native full-duplex chatbots via dual training Y Yao, X Li, X Jiang, X Fang, N Yu, W Ma, A Sun, Y Wang arXiv preprint arXiv:2509.02521, 2025 | 1 | 2025 |
| Open-domain implicit format control for large language model generation Y Yao, W Ma, X Fang, X Jiang, X Li, X Meng, P Han, J Li, A Sun, Y Wang arXiv preprint arXiv:2408.04392, 2024 | 1 | 2024 |
| nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales Y Yao, X Huang, X Fang, X Li, Z Ni, X Jiang, X Meng, P Han, S Shang, ... arXiv preprint arXiv:2304.06875, 2023 | 1 | 2023 |
| NanoLM: An Affordable LLM Study Benchmark via Accurate Loss Prediction Across Scales S Fan, X Huang, X Fang, Y Yao, X Li, Z Ni, X Jiang, X Meng, P Han, ... | | |