[go: up one dir, main page]

Follow
Di Hu
Di Hu
Tenure-track Associate Professor, Renmin University of China
Verified email at ruc.edu.cn - Homepage
Title
Cited by
Cited by
Year
Balanced multimodal learning via on-the-fly gradient modulation
X Peng, Y Wei, A Deng, D Wang, D Hu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
4412022
Deep multimodal clustering for unsupervised audiovisual learning
D Hu, F Nie, X Li
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
2882019
Learning to answer questions in dynamic audio-visual scenarios
G Li, Y Wei, Y Tian, C Xu, JR Wen, D Hu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
2352022
Multiple Sound Sources Localization from Coarse to Fine
R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin
arXiv preprint arXiv:2007.06355, 2020
2232020
Deep binary reconstruction for cross-modal hashing
X Li, D Hu, F Nie
Proceedings of the 25th ACM international conference on Multimedia, 1398-1406, 2017
2062017
Discriminative sounding objects localization via self-supervised audiovisual matching
D Hu, R Qian, M Jiang, X Tan, S Wen, E Ding, W Lin, D Dou
Advances in Neural Information Processing Systems 33, 10077-10087, 2020
1782020
Temporal multimodal learning in audiovisual speech recognition
D Hu, X Li
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
1402016
Unsupervised multi-source domain adaptation for person re-identification
Z Bai, Z Wang, J Wang, D Hu, E Ding
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
1302021
Cyclic co-learning of sounding object visual grounding and sound separation
Y Tian, D Hu, C Xu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
1142021
Multimodal fusion on low-quality data: A comprehensive survey
Q Zhang, Y Wei, Z Han, H Fu, X Peng, C Deng, Q Hu, C Xu, J Wen, D Hu, ...
arXiv preprint arXiv:2404.18947, 2024
1042024
Learning in audio-visual context: A review, analysis, and new perspective
Y Wei, D Hu, Y Tian, X Li
arXiv preprint arXiv:2208.09579, 2022
1012022
Large graph hashing with spectral rotation
X Li, D Hu, F Nie
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
692017
Enhancing multimodal cooperation via sample-level modality valuation
Y Wei, R Feng, Z Wang, D Hu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
682024
Temporal relational modeling with self-supervision for action segmentation
D Wang, D Hu, X Li, D Dou
Proceedings of the AAAI conference on artificial intelligence 35 (4), 2729-2737, 2021
672021
Self-supervised audiovisual representation learning for remote sensing data
K Heidler, L Mou, D Hu, P Jin, G Li, C Gan, JR Wen, XX Zhu
International Journal of Applied Earth Observation and Geoinformation 116 …, 2023
632023
Mmpareto: Boosting multimodal learning with innocent unimodal assistance
Y Wei, D Hu
arXiv preprint arXiv:2405.17730, 2024
602024
Mmcosine: Multi-modal cosine loss towards balanced audio-visual fine-grained learning
R Xu, R Feng, SX Zhang, D Hu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
602023
Dense multimodal fusion for hierarchically joint representation
D Hu, C Wang, F Nie, X Li
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
552019
Progressive spatio-temporal perception for audio-visual question answering
G Li, W Hou, D Hu
Proceedings of the 31st ACM international conference on multimedia, 7808-7816, 2023
542023
Class-aware sounding objects localization via audiovisual correspondence
D Hu, Y Wei, R Qian, W Lin, R Song, JR Wen
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 9844 …, 2021
512021
The system can't perform the operation now. Try again later.
Articles 1–20