| Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025 | 1629* | 2025 |
| Automl-gpt: Automatic machine learning with gpt S Zhang, C Gong, L Wu, X Liu, M Zhou arXiv preprint arXiv:2305.02499, 2023 | 139 | 2023 |
| A prototype-oriented framework for unsupervised domain adaptation K Tanwisuth, X Fan, H Zheng, S Zhang, H Zhang, B Chen, M Zhou Advances in Neural Information Processing Systems 34, 17194-17208, 2021 | 139 | 2021 |
| Fusedream: Training-free text-to-image generation with improved clip+ gan space optimization X Liu, C Gong, L Wu, S Zhang, H Su, Q Liu arXiv preprint arXiv:2112.01573, 2021 | 111 | 2021 |
| Introducing gemini 2.0: our new ai model for the agentic era S Pichai, D Hassabis, K Kavukcuoglu Google blog, 2024 | 107* | 2024 |
| Bayesian attention modules X Fan, S Zhang, B Chen, M Zhou Advances in Neural Information Processing Systems 33, 16362-16376, 2020 | 89 | 2020 |
| Pouf: Prompt-oriented unsupervised fine-tuning for large pre-trained models K Tanwisuth, S Zhang, H Zheng, P He, M Zhou International conference on machine learning, 33816-33832, 2023 | 59 | 2023 |
| Knowing more about questions can help: Improving calibration in question answering S Zhang, C Gong, E Choi arXiv preprint arXiv:2106.01494, 2021 | 54 | 2021 |
| WPO: Enhancing RLHF with Weighted Preference Optimization W Zhou, R Agrawal, S Zhang, SR Indurthi, S Zhao, K Song, S Xu, C Zhu Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 46 | 2024 |
| Bayesian attention belief networks S Zhang, X Fan, B Chen, M Zhou International Conference on Machine Learning, 12413-12426, 2021 | 44 | 2021 |
| Allsh: Active learning guided by local sensitivity and hardness S Zhang, C Gong, X Liu, P He, W Chen, M Zhou arXiv preprint arXiv:2205.04980, 2022 | 42 | 2022 |
| Learning from uneven training data: Unlabeled, single label, and multiple labels S Zhang, C Gong, E Choi arXiv e-prints, arXiv: 2109.04408, 2021 | 42* | 2021 |
| Contextual dropout: An efficient sample-dependent dropout module X Fan, S Zhang, K Tanwisuth, X Qian, M Zhou arXiv preprint arXiv:2103.04181, 2021 | 39 | 2021 |
| Instructional segment embedding: Improving llm safety with instruction hierarchy T Wu, S Zhang, K Song, S Xu, S Zhao, R Agrawal, SR Indurthi, C Xiang, ... arXiv preprint arXiv:2410.09102, 2024 | 34 | 2024 |
| Preference-grounded token-level guidance for language model fine-tuning S Yang, S Zhang, C Xia, Y Feng, C Xiong, M Zhou Advances in Neural Information Processing Systems 36, 24466-24496, 2023 | 30 | 2023 |
| Fantastic rewards and how to tame them: A case study on reward learning for task-oriented dialogue systems Y Feng, S Yang, S Zhang, J Zhang, C Xiong, M Zhou, H Wang arXiv preprint arXiv:2302.10342, 2023 | 30 | 2023 |
| Flowgrad: Controlling the output of generative odes with gradients X Liu, L Wu, S Zhang, C Gong, W Ping, Q Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 22 | 2023 |
| Sliced Wasserstein with random-path projecting directions K Nguyen, S Zhang, T Le, N Ho Proceedings of the ICML, 2024, 2024 | 20 | 2024 |
| A unified framework for alternating offline model training and policy learning S Yang, S Zhang, Y Feng, M Zhou Advances in Neural Information Processing Systems 35, 17216-17232, 2022 | 19 | 2022 |
| Alignment attention by matching key and query distributions S Zhang, X Fan, H Zheng, K Tanwisuth, M Zhou Advances in Neural Information Processing Systems 34, 13444-13457, 2021 | 19 | 2021 |