| Rotate-and-render: Unsupervised photorealistic face rotation from single-view images H Zhou, J Liu, Z Liu, Y Liu, X Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 158 | 2020 |
| Seed1. 5-vl technical report D Guo, F Wu, F Zhu, F Leng, G Shi, H Chen, H Fan, J Wang, J Jiang, ... arXiv preprint arXiv:2505.07062, 2025 | 153 | 2025 |
| Meta Knowledge Distillation J Liu, B Liu, H Li, Y Liu arxiv preprint, https://arxiv.org/abs/2202.07940, 2022 | 119* | 2022 |
| Seed1. 5-thinking: Advancing superb reasoning models with reinforcement learning BD Seed, J Chen, T Fan, X Liu, L Liu, Z Lin, M Wang, C Wang, X Wei, ... arXiv preprint arXiv:2504.13914, 2025 | 117 | 2025 |
| Development of deep learning algorithms for predicting blastocyst formation and quality by time-lapse monitoring Q Liao, Q Zhang, X Feng, H Huang, H Xu, B Tian, J Liu, Q Yu, N Guo, ... Communications biology 4 (1), 415, 2021 | 105 | 2021 |
| MixMAE: Mixed and masked autoencoder for efficient pretraining of hierarchical vision transformers J Liu, X Huang, J Zheng, Y Liu, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 97 | 2023 |
| Tokenmix: Rethinking image mixing for data augmentation in vision transformers J Liu, B Liu, H Zhou, H Li, Y Liu European conference on computer vision, 455-471, 2022 | 94 | 2022 |
| Learning where to focus for efficient video object detection Z Jiang, Y Liu, C Yang, J Liu, P Gao, Q Zhang, S Xiang, C Pan European conference on computer vision, 18-34, 2020 | 89 | 2020 |
| Mixmim: Mixed and masked image modeling for efficient visual representation learning J Liu, X Huang, Y Liu, H Li CVPR, 2022 | 62 | 2022 |
| Uninet: Unified architecture search with convolution, transformer, and mlp J Liu, X Huang, G Song, H Li, Y Liu European Conference on computer vision, 33-49, 2022 | 49 | 2022 |
| Intern: A new learning paradigm towards general vision J Shao, S Chen, Y Li, K Wang, Z Yin, Y He, J Teng, Q Sun, M Gao, J Liu, ... arXiv preprint arXiv:2111.08687, 2021 | 42 | 2021 |
| GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding J Liu, T Wang, B Liu, Q Zhang, Y Liu, H Li ICCV 2023, 2023 | 21 | 2023 |
| Towards flops-constrained face recognition Y Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 21 | 2019 |
| Easydrag: Efficient point-based manipulation on diffusion models X Hou, B Liu, Y Zhang, J Liu, Y Liu, H You Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 20 | 2024 |
| Decisionnce: Embodied multimodal representations via implicit preference learning J Li, J Zheng, Y Zheng, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ... arXiv preprint arXiv:2402.18137, 2024 | 15 | 2024 |
| Instruction-guided visual masking J Zheng, J Li, S Cheng, Y Zheng, J Li, J Liu, Y Liu, J Liu, X Zhan Advances in neural information processing systems 37, 126004-126031, 2024 | 14 | 2024 |
| Streamchat: Chatting with streaming video J Liu, Z Yu, S Lan, S Wang, R Fang, J Kautz, H Li, JM Alvare arXiv preprint arXiv:2412.08646, 2024 | 11 | 2024 |
| Glid: Pre-training a generalist encoder-decoder vision model J Liu, J Zheng, Y Liu, H Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024 | 9 | 2024 |
| Fnas: Uncertainty-aware fast neural architecture search J Liu, M Zhang, Y Sun, B Liu, G Song, Y Liu, H Li arXiv preprint arXiv:2105.11694, 2021 | 7 | 2021 |
| Mm-instruct: Generated visual instructions for large multimodal model alignment J Liu, X Huang, J Zheng, B Liu, J Wang, O Yoshie, Y Liu, H Li arXiv preprint arXiv:2406.19736, 2024 | 6 | 2024 |