[go: up one dir, main page]

Follow
Haiwen Diao
Haiwen Diao
DLUT, NTU
Verified email at ntu.edu.sg - Homepage
Title
Cited by
Cited by
Year
Similarity Reasoning and Filtration for Image-Text Matching
H Diao, Y Zhang, L Ma, H Lu
AAAI Conference on Artificial Intelligence (AAAI), 2021
4702021
Autoregressive Video Generation without Vector Quantization
H Deng*, T Pan*, H Diao*, Z Luo*, Y Cui, H Lu, S Shan, Y Qi, X Wang
International Conference on Learning Representations (ICLR), 2024
982024
Unveiling Encoder-Free Vision-Language Models
H Diao*, Y Cui*, X Li, Y Wang, H Lu, X Wang
Advances in Neural Information Processing Systems (NeurIPS, spotlight), 2024
742024
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
X Li*, F Zhang*, H Diao*, Y Wang, X Wang, LY Duan
Advances in Neural Information Processing Systems (NeurIPS), 2024
542024
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
S Gu, J Zhang, S Zhou, K Yu, Z Xing, L Wang, Z Cao, J Jia, Z Zhang, ...
arXiv preprint arXiv:2410.18558, 2024
502024
Plug-and-Play Regulators for Image-Text Matching
H Diao, Y Zhang, W Liu, X Ruan, H Lu
IEEE Transactions on Image Processing (TIP), 2023
382023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
H Diao, B Wan, Y Zhang, X Jia, H Lu, L Chen
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
352024
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
H Diao*, X Li*, Y Cui*, Y Wang*, H Deng, T Pan, W Wang, H Lu, X Wang
International Conference on Computer Vision (ICCV, highlight), 2025
172025
Exploring Dynamic Transformer for Efficient Object Tracking
J Zhu, X Chen, H Diao, S Li, JY He, C Li, B Luo, D Wang, H Lu
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
162024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
H Diao, B Wan, X Jia, Y Zhuge, Y Zhang, H Lu, L Chen
European Conference on Computer Vision (ECCV), 2024
142024
LLMs Can Evolve Continually on Modality for X-Modal Reasoning
J Yu, H Xiong, L Zhang, H Diao, Y Zhuge, L Hong, D Wang, H Lu, Y He, ...
Advances in Neural Information Processing Systems (NeurIPS), 2024
13*2024
MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
X Li, X Jia, Q Wang, H Diao, P Li, Y He, H Lu
ACM International Conference on Multimedia (ACMMM), 2024
102024
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning
H Diao, Y Zhang, S Gao, J Zhu, L Chen, H Lu
IEEE Transactions on Image Processing (TIP), 2024
92024
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching
H Diao, Y Zhang, S Gao, X Ruan, H Lu
IEEE Transactions on Image Processing (TIP), 2024
72024
Visual Jigsaw Post-Training Improves MLLMs
P Wu, Y Zhang, H Diao, B Li, L Lu, Z Liu
arXiv preprint arXiv:2509.25190, 2025
62025
End-to-End Vision Tokenizer Tuning
W Wang*, F Zhang*, Y Cui*, H Diao*, Z Luo, H Lu, J Liu, X Wang
Advances in Neural Information Processing Systems (NeurIPS), 2025
22025
KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification
Y Zhu*, H Diao*, S Gao*, L Chen, H Lu
lEEE International Conference on Acoustics, Speech, and Signal Processing …, 2025
12025
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
W Fan, H Diao, Q Wang, D Lin, Z Liu
arXiv preprint arXiv:2512.19693, 2025
2025
From Pixels to Words--Towards Native Vision-Language Primitives at Scale
H Diao, M Li, S Wu, L Dai, X Wang, H Deng, L Lu, D Lin, Z Liu
arXiv preprint arXiv:2510.14979, 2025
2025
Regularizing Subspace Redundancy of Low-Rank Adaptation
Y Zhu*, H Diao*, S Gao*, J Yu, J Zhu, Y Zhuge, S Hao, X Jia, L Zhang, ...
ACM International Conference on Multimedia (ACMMM), 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–20