Wei Xue

Cited by

	All	Since 2021
Citations	3024	2910
h-index	26	25
i10-index	57	55

2000

1000

500

1500

2016201720182019202020212022202320242025202611 10 18 25 41 58 72 103 612 1938 118

Public access

View all

35 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yike GuoDept of CSE, The Hong Kong University of Science and TechnologyVerified email at ust.hk
Ruibin YuanHKUSTVerified email at andrew.cmu.edu
Chi-Min ChanHKUSTVerified email at connect.ust.hk
Shanghang ZhangPeking UniversityVerified email at pku.edu.cn
Zhen YeThe Hong Kong University Of Science And TechnologyVerified email at connect.ust.hk
Wenhan LuoAssociate Professor, HKUSTVerified email at ust.hk
Jiahao PanHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Jie FuIQuest ResearchVerified email at lisa.iro.umontreal.ca
Zeyue TianHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Patrick A. NaylorImperial College LondonVerified email at imperial.ac.uk
Mike BrookesReader in Signal Processing, Imperial College LondonVerified email at imperial.ac.uk
Xingqun QiThe Hong Kong University of Science and Technology (HKUST)Verified email at connect.ust.hk
Xu TanPrincipal Researcher and Research Manager, MicrosoftVerified email at microsoft.com
Xiaowei ChiThe Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Alastair H MooreImperial College LondonVerified email at imperial.ac.uk
Qifeng ChenHKUSTVerified email at ust.hk
Xinfa ZhuNorthwestern Polytechnical UniversityVerified email at mail.nwpu.edu.cn
Peiwen SunMultimedia lab, The Chinese University of Hong KongVerified email at link.cuhk.edu.hk
Peng LiHKUST｜ Tsinghua UniversityVerified email at mails.tsinghua.edu.cn

Wei Xue

HKUST

Verified email at ust.hk - Homepage

Audio Processing AI Music Foundation Models Generative AI Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Chateval: Towards better llm-based evaluators through multi-agent debate CM Chan, W Chen, Y Su, J Yu, W Xue, S Zhang, J Fu, Z Liu ICLR 2024, 2024	843	2024
RQ-RAG: Learning to refine queries for retrieval augmented generation CM Chan, C Xu, R Yuan, H Luo, W Xue, Y Guo, J Fu COLM 2024, 2024	211	2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ... ACL Findings 2024, 2024	107	2024
Spark-tts: An efficient llm-based text-to-speech model with single-stream decoupled speech tokens X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ... arXiv preprint arXiv:2503.01710, 2025	94	2025
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation J Liu, S Yang, P Jia, M Lu, Y Guo, W Xue, S Zhang ICLR 2024, 2024	82	2024
Llasa: Scaling train-time and inference-time compute for llama-based speech synthesis Z Ye, X Zhu, CM Chan, X Wang, X Tan, J Lei, Y Peng, H Liu, Y Jin, Z Dai, ... arXiv preprint arXiv:2502.04128, 2025	67	2025
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model Z Ye, W Xue, X Tan, J Chen, Q Liu, Y Guo ACM MM 2023, 2023	66	2023
Long short-term memory recurrent neural network based segment features for music genre classification J Dai, S Liang, W Xue, C Ni, W Liu ISCSLP 2016, 2016	61	2016
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model Z Ye, P Sun, J Lei, H Lin, X Tan, Z Dai, Q Kong, J Chen, J Pan, Q Liu, ... AAAI 2025, 2025	57	2025
ComposerX: Multi-Agent Symbolic Music Composition with LLMs Q Deng, Q Yang, R Yuan, Y Huang, Y Wang, X Liu, Z Tian, J Pan, ... ISMIR 2024, 2024	56	2024
MARBLE: Music Audio Representation Benchmark for Universal Evaluation R Yuan, Y Ma, Y Li, G Zhang, X Chen, H Yin, L Zhuo, Y Liu, J Huang, ... NeurIPS 2023, 2023	55	2023
Llms meet multimodal generation and editing: A survey Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ... arXiv preprint arXiv:2405.19334, 2024	54	2024
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix Z Ma, Y Ma, Y Zhu, C Yang, YW Chao, R Xu, W Chen, Y Chen, Z Chen, ... NeurIPS 2025, 2025	44	2025
Vidmuse: A simple video-to-music generation framework with long-short-term modeling Z Tian, Z Liu, R Yuan, J Pan, Q Liu, X Tan, Q Chen, W Xue, Y Guo CVPR 2025, 2025	43	2025
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping V Kothapally, W Xia, S Ghorbani, JHL Hansen, W Xue, J Huang INTERSPEECH 2020, 2020	43	2020
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion P Li, W Zheng, Y Liu, T Yu, Y Li, X Qi, M Li, X Chi, S Xia, W Xue, W Luo, ... CVPR 2025, 2025	42*	2025
Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models L Li, P Dong, Z Tang, X Liu, Q Wang, W Luo, W Xue, Q Liu, X Chu, Y Guo NeurIPS 2024, 2024	42	2024
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio S Liang, W Liu, W Jiang, W Xue The Journal of the Acoustical Society of America 134 (5), EL452-EL458, 2013	38	2013
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs P Dong, L Li, Y Zhong, D Du, R Fan, Y Chen, Z Tang, Q Wang, W Xue, ... ICLR 2025, 2025	37	2025
DetKDS: Knowledge Distillation Search for Object Detectors L Li, Y Bao, P Dong, C Yang, A Li, W Luo, Q Liu, W Xue, Y Guo ICML 2024, 2024	37	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors