Di Hu

Cited by

	All	Since 2021
Citations	3639	3397
h-index	31	30
i10-index	52	51

1500

750

375

1125

201720182019202020212022202320242025202620 39 55 121 216 363 540 841 1401 34

Public access

View all

37 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Yake WeiRenmin University of ChinaVerified email at ruc.edu.cn
Xuelong Li（李学龙）Fellow of AAAI 202...TeleAI of China Telecom（中国电信人工智能研究院）Verified email at ieee.org
Feiping NieOPTIMALVerified email at mails.tsinghua.edu.cn
Guangyao LiTsinghua UniversityVerified email at tsinghua.edu.cn
Ruoxuan FengRenmin University of ChinaVerified email at ruc.edu.cn
Wenke XiaRenmin University of ChinaVerified email at ruc.edu.cn
Yapeng TianAssistant Professor, University of Texas at DallasVerified email at utdallas.edu
Ji-Rong WenGaoling School of Artificial Intelligence, Renmin University of ChinaVerified email at ruc.edu.cn
Lichao MouGerman Aerospace Center (DLR), Technical University of Munich (TUM)Verified email at dlr.de
Dongzhan ZhouResearcher at Shanghai AI LabVerified email at pjlab.org.cn
Haojin YangHasso Plattner Institute | GreenBit.AIVerified email at hpi.de
Wanli Ouyang (欧阳万里)SLAI & CUHK & Shanghai AI LabVerified email at sydney.edu.au

Di Hu

Tenure-track Associate Professor, Renmin University of China

Verified email at ruc.edu.cn - Homepage

Multimodal Perception Multimodal Learning Multimodal Interaction


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Balanced multimodal learning via on-the-fly gradient modulation X Peng, Y Wei, A Deng, D Wang, D Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	441	2022
Deep multimodal clustering for unsupervised audiovisual learning D Hu, F Nie, X Li Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019	288	2019
Learning to answer questions in dynamic audio-visual scenarios G Li, Y Wei, Y Tian, C Xu, JR Wen, D Hu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	235	2022
Multiple Sound Sources Localization from Coarse to Fine R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin arXiv preprint arXiv:2007.06355, 2020	223	2020
Deep binary reconstruction for cross-modal hashing X Li, D Hu, F Nie Proceedings of the 25th ACM international conference on Multimedia, 1398-1406, 2017	206	2017
Discriminative sounding objects localization via self-supervised audiovisual matching D Hu, R Qian, M Jiang, X Tan, S Wen, E Ding, W Lin, D Dou Advances in Neural Information Processing Systems 33, 10077-10087, 2020	178	2020
Temporal multimodal learning in audiovisual speech recognition D Hu, X Li Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016	140	2016
Unsupervised multi-source domain adaptation for person re-identification Z Bai, Z Wang, J Wang, D Hu, E Ding Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	130	2021
Cyclic co-learning of sounding object visual grounding and sound separation Y Tian, D Hu, C Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	114	2021
Multimodal fusion on low-quality data: A comprehensive survey Q Zhang, Y Wei, Z Han, H Fu, X Peng, C Deng, Q Hu, C Xu, J Wen, D Hu, ... arXiv preprint arXiv:2404.18947, 2024	104	2024
Learning in audio-visual context: A review, analysis, and new perspective Y Wei, D Hu, Y Tian, X Li arXiv preprint arXiv:2208.09579, 2022	101	2022
Large graph hashing with spectral rotation X Li, D Hu, F Nie Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	69	2017
Enhancing multimodal cooperation via sample-level modality valuation Y Wei, R Feng, Z Wang, D Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	68	2024
Temporal relational modeling with self-supervision for action segmentation D Wang, D Hu, X Li, D Dou Proceedings of the AAAI conference on artificial intelligence 35 (4), 2729-2737, 2021	67	2021
Self-supervised audiovisual representation learning for remote sensing data K Heidler, L Mou, D Hu, P Jin, G Li, C Gan, JR Wen, XX Zhu International Journal of Applied Earth Observation and Geoinformation 116 …, 2023	63	2023
Mmpareto: Boosting multimodal learning with innocent unimodal assistance Y Wei, D Hu arXiv preprint arXiv:2405.17730, 2024	60	2024
Mmcosine: Multi-modal cosine loss towards balanced audio-visual fine-grained learning R Xu, R Feng, SX Zhang, D Hu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	60	2023
Dense multimodal fusion for hierarchically joint representation D Hu, C Wang, F Nie, X Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	55	2019
Progressive spatio-temporal perception for audio-visual question answering G Li, W Hou, D Hu Proceedings of the 31st ACM international conference on multimedia, 7808-7816, 2023	54	2023
Class-aware sounding objects localization via audiovisual correspondence D Hu, Y Wei, R Qian, W Lin, R Song, JR Wen IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 9844 …, 2021	51	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors