[go: up one dir, main page]

Follow
Zhuofan Zong
Title
Cited by
Cited by
Year
Detrs with collaborative hybrid assignments training
Z Zong, G Song, Y Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
7222023
Visual cot: Advancing multi-modal language models with a comprehensive dataset and benchmark for chain-of-thought reasoning
H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li
Advances in Neural Information Processing Systems 37, 8612-8642, 2024
2232024
Raphael: Text-to-image generation via large mixture of diffusion paths
Z Xue, G Song, Q Guo, B Liu, Z Zong, Y Liu, P Luo
Advances in Neural Information Processing Systems 36, 41693-41706, 2023
2152023
Mova: Adapting mixture of vision experts to multimodal context
Z Zong*, B Ma*, D Shen, G Song, H Shao, D Jiang, H Li, Y Liu
Advances in Neural Information Processing Systems 37, 103305-103333, 2024
962024
Visual cot: Unleashing chain-of-thought reasoning in multi-modal language models
H Shao, S Qian, H Xiao, G Song, Z Zong, L Wang, Y Liu, H Li
CoRR, 2024
862024
T2i-r1: Reinforcing image generation with collaborative semantic-level and token-level cot
D Jiang, Z Guo, R Zhang, Z Zong, H Li, L Zhuo, S Yan, PA Heng, H Li
arXiv preprint arXiv:2505.00703, 2025
822025
Graph attention based proposal 3d convnets for action detection
J Li, X Liu, Z Zong, W Zhao, M Zhang, J Song
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4626-4633, 2020
672020
Comat: Aligning text-to-image diffusion model with image-to-text concept matching
D Jiang, G Song, X Wu, R Zhang, D Shen, Z Zong, Y Liu, H Li
Advances in Neural Information Processing Systems 37, 76177-76209, 2024
552024
Self-slimmed vision transformer
Z Zong*, K Li*, G Song, Y Wang, Y Qiao, B Leng, Y Liu
European Conference on Computer Vision, 432-448, 2022
532022
Temporal enhanced training of multi-view 3d object detector via historical object prediction
Z Zong*, D Jiang*, G Song, Z Xue, J Su, H Li, Y Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
492023
Exploring the role of large language models in prompt encoding for diffusion models
B Ma*, Z Zong*, G Song, H Li, Y Liu
Advances in Neural Information Processing Systems 37, 118428-118455, 2024
402024
RCNet: Reverse feature pyramid and cross-scale shift network for object detection
Z Zong, Q Cao, B Leng
Proceedings of the 29th ACM International Conference on Multimedia, 5637-5645, 2021
262021
Jingyong Su, Hongsheng Li, and Yu Liu. Temporal enhanced training of multi-view 3d object detector via historical object prediction
Z Zong, D Jiang, G Song, Z Xue
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
252023
Easyref: Omni-generalized group image reference for diffusion models via multimodal llm
Z Zong, D Jiang, B Ma, G Song, H Shao, D Shen, Y Liu, H Li
Forty-second International Conference on Machine Learning, 2024
142024
DETRs with collaborative hybrid assignments training (2023)
Z Zong, G Song, Y Liu
arXiv preprint arXiv:2211.12860, 0
11
Large-batch optimization for dense visual predictions: Training faster R-CNN in 4.2 minutes
Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo
Advances in Neural Information Processing Systems 35, 18694-18706, 2022
72022
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping
H Shao, S Wang, Y Zhou, G Song, D He, S Qin, Z Zong, B Ma, Y Liu, H Li
arXiv preprint arXiv:2412.11279, 2024
52024
Large-batch optimization for dense visual predictions
Z Xue, J Liang, G Song, Z Zong, L Chen, Y Liu, P Luo
Advances in Neural Information Processing Systems 1, 2022
52022
ADT: Tuning Diffusion Models with Adversarial Supervision
D Shen, G Song, Y Zhang, B Ma, L Li, D Jiang, Z Zong, Y Liu
arXiv preprint arXiv:2504.11423, 2025
32025
Webgen-agent: Enhancing interactive website generation with multi-level feedback and step-level reinforcement learning
Z Lu, H Ren, Y Yang, K Wang, Z Zong, J Pan, M Zhan, H Li
arXiv preprint arXiv:2509.22644, 2025
22025
The system can't perform the operation now. Try again later.
Articles 1–20