| Atlantis: Aesthetic-oriented multiple granularities fusion network for joint multimodal aspect-based sentiment analysis L Xiao, X Wu, J Xu, W Li, C Jin, L He Information Fusion 106, 102304, 2024 | 82 | 2024 |
| Unified reward model for multimodal understanding and generation Y Wang, Y Zang, H Li, C Jin, J Wang arXiv preprint arXiv:2503.05236, 2025 | 72 | 2025 |
| A simplified multi-class support vector machine with reduced dual optimization X He, Z Wang, C Jin, Y Zheng, X Xue Pattern Recognition Letters 33 (1), 71-82, 2012 | 72 | 2012 |
| Learning attention map from images Y Lu, W Zhang, C Jin, X Xue 2012 IEEE Conference on Computer Vision and Pattern Recognition, 1067-1074, 2012 | 43 | 2012 |
| ETR: An Efficient Transformer for Re-Ranking in Visual Place Recognition H Zhang, X Chen, H Jing, Y Zheng, Y Wu, C Jin Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 40 | 2023 |
| Sketch-based image retrieval with deep visual semantic descriptor F Huang, C Jin, Y Zhang, K Weng, T Zhang, W Fan Pattern Recognition 76, 537-548, 2018 | 39 | 2018 |
| Unified multimodal chain-of-thought reward model through reinforcement fine-tuning Y Wang, Z Li, Y Zang, C Wang, Q Lu, C Jin, J Wang arXiv preprint arXiv:2505.03318, 2025 | 35 | 2025 |
| Cross-Modal Image Clustering via Canonical Correlation Analysis C Jin, W Mao, R Zhang, Y Zhang, X Xue Twenty-Ninth AAAI Conference on Artificial Intelligence, 151-159, 2015 | 32 | 2015 |
| Lift: Leveraging human feedback for text-to-video model alignment Y Wang, Z Tan, J Wang, X Yang, C Jin, H Li arXiv preprint arXiv:2412.04814, 2024 | 29 | 2024 |
| High-fidelity Person-centric Subject-to-Image Synthesis Y Wang, W Zhang, J Zheng, C Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 29 | 2024 |
| A Hierarchical Multimodal Attention-based Neural Network for Image Captioning Y Cheng, F Huang, L Zhou, C Jin, Y Zhang, T Zhang Proceedings of the 40th International ACM SIGIR Conference on Research and …, 2017 | 29 | 2017 |
| Attention-based transformation from latent features to point clouds K Zhang, X Yang, Y Wu, C Jin Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 3291-3299, 2022 | 27 | 2022 |
| Pref-grpo: Pairwise preference reward-based grpo for stable text-to-image reinforcement learning Y Wang, Z Li, Y Zang, Y Zhou, J Bu, C Wang, Q Lu, C Jin, J Wang arXiv preprint arXiv:2508.20751, 2025 | 26 | 2025 |
| FusionFormer: A Concise Unified Feature Fusion Transformer for 3D Pose Estimation Y Cai, W Zhang, Y Wu, C Jin Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 900-908, 2024 | 24 | 2024 |
| GLTA-GCN: Global-Local Temporal Attention Graph Convolutional Network for Unsupervised Skeleton-Based Action Recognition H Qiu, Y Wu, MM Duan, C Jin 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 23 | 2022 |
| CPCGAN: A Controllable 3D Point Cloud Generative Adversarial Network with Semantic Label Generating X Yang, Y Wu, K Zhang, C Jin Proceedings of the AAAI Conference on Artificial Intelligence 35 (4), 3154-3162, 2021 | 23 | 2021 |
| Tracking user-preference varying speed in collaborative filtering R Li, B Li, C Jin, X Xue, X Zhu Proceedings of the AAAI Conference on Artificial Intelligence 25 (1), 133-138, 2011 | 23 | 2011 |
| Primecomposer: Faster progressively combined diffusion for image composition with attention steering Y Wang, W Zhang, J Zheng, C Jin Proceedings of the 32nd ACM International Conference on Multimedia, 10824-10832, 2024 | 20 | 2024 |
| DDT: Dual-branch deformable transformer for image denoising K Liu, X Du, S Liu, Y Zheng, X Wu, C Jin 2023 IEEE International Conference on Multimedia and Expo (ICME), 2765-2770, 2023 | 19 | 2023 |
| Fully convolutional video captioning with coarse-to-fine and inherited attention K Fang, L Zhou, C Jin, Y Zhang, K Weng, T Zhang, W Fan proceedings of the AAAI conference on artificial intelligence 33 (01), 8271-8278, 2019 | 19 | 2019 |