[go: up one dir, main page]

Follow
Xuanjun Chen
Xuanjun Chen
Other names陳炫均 Victor Chen
Verified email at ntu.edu.tw - Homepage
Title
Cited by
Cited by
Year
Towards audio language modeling--an overview
H Wu, X Chen, YC Lin, K Chang, HL Chung, AH Liu, H Lee
arXiv preprint arXiv:2402.13236, 2024
642024
Codec-SUPERB: An in-depth analysis of sound codec models
H Wu, HL Chung, YC Lin, YK Wu, X Chen, YC Pai, HH Wang, KW Chang, ...
Findings of the Association for Computational Linguistics: ACL 2024, 2024
512024
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
The Thirteenth International Conference on Learning Representations (ICLR), 2025
492025
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
KH Lu, Z Chen, SW Fu, CHH Yang, SF Huang, CK Yang, CE Yu, ...
IEEE Transactions on Audio, Speech, and Language Processing (TASLP, Submitted), 2025
192025
Singing Voice Graph Modeling for SingFake Detection
X Chen, H Wu, JSR Jang, H Lee
Proc. INTERSPEECH 2024, 2958-1796, 2024
182024
DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset
J Du, IM Lin, IH Chiu, X Chen, H Wu, W Ren, Y Tsao, H Lee, JSR Jang
The IEEE Spoken Language Technology Workshop (IEEE SLT), 2024
172024
A preliminary exploration with gpt-4o voice mode
YX Lin, CK Yang, WC Chen, CA Li, C Huang, X Chen, H Lee
arXiv preprint arXiv:2502.09940, 2025
162025
Codec-superb@ slt 2024: A lightweight benchmark for neural audio codec models
H Wu, X Chen, YC Lin, K Chang, J Du, KH Lu, AH Liu, HL Chung, YK Wu, ...
The IEEE Spoken Language Technology Workshop (IEEE SLT), 2024
162024
CodecFake+: A Large-Scale Neural Audio Codec-Based Deepfake Speech Dataset
X Chen, J Du, H Wu, L Zhang, I Lin, I Chiu, W Ren, Y Tseng, Y Tsao, ...
IEEE Transactions on Audio, Speech, and Language Processing (TASLP, Submitted), 2025
12*2025
Adversarial speaker distillation for countermeasure model on automatic speaker verification
YL Liao*, X Chen*, CC Wang, JSR Jang
The 2nd Symposium on Security and Privacy in Speech Communication (Satellite …, 2022
112022
Building a taiwanese mandarin spoken language model: A first attempt
CK Yang, YK Fu, CA Li, YC Lin, YX Lin, WC Chen, HL Chung, CY Kuan, ...
arXiv preprint arXiv:2411.07111, 2024
102024
Neural Codec-based Adversarial Sample Detection for Speaker Verification
X Chen, J Du, H Wu, JSR Jang, H Lee
Proc. INTERSPEECH 2024, 2958-1796, 2024
92024
Multimodal transformer distillation for audio-visual synchronization
X Chen, H Wu, CC Wang, H Lee, JSR Jang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech, and …, 2024
62024
Push-pull: Characterizing the adversarial robustness for audio-visual active speaker detection
X Chen, H Wu, H Meng, H Lee, JSR Jang
The IEEE Spoken Language Technology Workshop (IEEE SLT), 2023
62023
Towards Generalized Source Tracing for Codec-Based Deepfake Speech
X Chen, IM Lin, L Zhang, H Wu, H Lee, JSR Jang
🏆 IEEE ASRU Best Student Paper Nominee, at the IEEE Automatic Speech …, 2025
32025
Codec-Based Deepfake Source Tracing via Neural Audio Codec Taxonomy
X Chen, IM Lin, L Zhang, J Du, H Wu, H Lee, JSR Jang
Proc. INTERSPEECH 2025, 2958-1796, 2025
32025
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
W Ren, H Wu, YC Lin, X Chen, R Chao, KH Hung, YJ Li, WY Ting, ...
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech, and …, 2025
32025
Exploring State-Space-Model based Language Model in Music Generation
WJ Lee, FC Hsieh, X Chen, FD Tsai, YH Yang
Proc. International Society for Music Information Retrieval (ISMIR) 2025 …, 2025
22025
Singer separation for karaoke content generation
HY Lin, X Chen, JSR Jang
The 27th International Conference on Oriental Coordination and …, 2024
22024
A Preliminary Study of RAG for Taiwanese Historical Archives
C Lin*, PH Feng*, X Chen*, TL Yang, H Lee, JSR Jang
🏆 ROCLING Best Paper Award, at the 37th Conference on Computational …, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–20