[go: up one dir, main page]

Follow
Wei Xue
Title
Cited by
Cited by
Year
Chateval: Towards better llm-based evaluators through multi-agent debate
CM Chan, W Chen, Y Su, J Yu, W Xue, S Zhang, J Fu, Z Liu
ICLR 2024, 2024
8432024
RQ-RAG: Learning to refine queries for retrieval augmented generation
CM Chan, C Xu, R Yuan, H Luo, W Xue, Y Guo, J Fu
COLM 2024, 2024
2112024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
ACL Findings 2024, 2024
1072024
Spark-tts: An efficient llm-based text-to-speech model with single-stream decoupled speech tokens
X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ...
arXiv preprint arXiv:2503.01710, 2025
942025
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
J Liu, S Yang, P Jia, M Lu, Y Guo, W Xue, S Zhang
ICLR 2024, 2024
822024
Llasa: Scaling train-time and inference-time compute for llama-based speech synthesis
Z Ye, X Zhu, CM Chan, X Wang, X Tan, J Lei, Y Peng, H Liu, Y Jin, Z Dai, ...
arXiv preprint arXiv:2502.04128, 2025
672025
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Z Ye, W Xue, X Tan, J Chen, Q Liu, Y Guo
ACM MM 2023, 2023
662023
Long short-term memory recurrent neural network based segment features for music genre classification
J Dai, S Liang, W Xue, C Ni, W Liu
ISCSLP 2016, 2016
612016
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Z Ye, P Sun, J Lei, H Lin, X Tan, Z Dai, Q Kong, J Chen, J Pan, Q Liu, ...
AAAI 2025, 2025
572025
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Q Deng, Q Yang, R Yuan, Y Huang, Y Wang, X Liu, Z Tian, J Pan, ...
ISMIR 2024, 2024
562024
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
R Yuan, Y Ma, Y Li, G Zhang, X Chen, H Yin, L Zhuo, Y Liu, J Huang, ...
NeurIPS 2023, 2023
552023
Llms meet multimodal generation and editing: A survey
Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ...
arXiv preprint arXiv:2405.19334, 2024
542024
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Z Ma, Y Ma, Y Zhu, C Yang, YW Chao, R Xu, W Chen, Y Chen, Z Chen, ...
NeurIPS 2025, 2025
442025
Vidmuse: A simple video-to-music generation framework with long-short-term modeling
Z Tian, Z Liu, R Yuan, J Pan, Q Liu, X Tan, Q Chen, W Xue, Y Guo
CVPR 2025, 2025
432025
SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping
V Kothapally, W Xia, S Ghorbani, JHL Hansen, W Xue, J Huang
INTERSPEECH 2020, 2020
432020
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
P Li, W Zheng, Y Liu, T Yu, Y Li, X Qi, M Li, X Chi, S Xia, W Xue, W Luo, ...
CVPR 2025, 2025
42*2025
Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models
L Li, P Dong, Z Tang, X Liu, Q Wang, W Luo, W Xue, Q Liu, X Chu, Y Guo
NeurIPS 2024, 2024
422024
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio
S Liang, W Liu, W Jiang, W Xue
The Journal of the Acoustical Society of America 134 (5), EL452-EL458, 2013
382013
STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
P Dong, L Li, Y Zhong, D Du, R Fan, Y Chen, Z Tang, Q Wang, W Xue, ...
ICLR 2025, 2025
372025
DetKDS: Knowledge Distillation Search for Object Detectors
L Li, Y Bao, P Dong, C Yang, A Li, W Luo, Q Liu, W Xue, Y Guo
ICML 2024, 2024
372024
The system can't perform the operation now. Try again later.
Articles 1–20