[go: up one dir, main page]

Follow
Junjie Li
Junjie Li
Other names李 俊杰
Ph.D., The Hong Kong Polytechnic University; Graduate Student, Tianjin University
Verified email at connect.polyu.hk - Homepage
Title
Cited by
Cited by
Year
Information bottleneck theory on convolutional neural networks
J Li, D Liu
Neural Processing Letters 53 (2), 1385-1400, 2021
252021
Wesep: A scalable and flexible toolkit towards generalizable target speaker extraction
S Wang, K Zhang, S Lin, J Li, X Wang, M Ge, J Yu, Y Qian, H Li
arXiv preprint arXiv:2409.15799, 2024
212024
Rethinking the visual cues in audio-visual speaker extraction
J Li, M Ge, R Cao, L Wang, J Dang, S Zhang
arXiv preprint arXiv:2306.02625, 2023
132023
VCSE: Time-domain visual-contextual speaker extraction network
J Li, M Ge, Z Pan, L Wang, J Dang
arXiv preprint arXiv:2210.06177, 2022
122022
Multi-level speaker representation for target speaker extraction
K Zhang, J Li, S Wang, Y Wei, Y Wang, Y Wang, H Li
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
112025
On the effectiveness of enrollment speech augmentation for target speaker extraction
J Li, K Zhang, S Wang, H Li, MW Mak, KA Lee
2024 IEEE Spoken Language Technology Workshop (SLT), 325-332, 2024
92024
Audio-visual active speaker extraction for sparsely overlapped multi-talker speech
J Li, R Tao, Z Pan, M Ge, S Wang, H Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
92024
Audio-visual target speaker extraction with selective auditory attention
R Tao, X Qian, Y Jiang, J Li, J Wang, H Li
IEEE Transactions on Audio, Speech and Language Processing, 2025
72025
Deep multi-task cascaded acoustic echo cancellation and noise suppression
J Li, M Ge, L Wang, J Dang
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
62022
Audio-visual target speaker extraction with reverse selective auditory attention
R Tao, X Qian, Y Jiang, J Li, J Wang, H Li
arXiv preprint arXiv:2404.18501, 2024
42024
Stream attention based U-Net for L3DAS23 challenge
H Wang, Y Fu, J Li, M Ge, L Wang, X Qian
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Momuse: Momentum multi-modal target speaker extraction for real-time scenarios with impaired visual cues
J Li, K Zhang, S Wang, KA Lee, MW Mak, H Li
2025 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2025
22025
Enhancing Speaker Extraction Through Rectifying Target Confusion
J Wang, S Wang, J Li, K Zhang, Y Qian, H Li
2024 IEEE Spoken Language Technology Workshop (SLT), 349-356, 2024
22024
MeMo: Attentional Momentum for Real-time Audio-visual Speaker Extraction under Impaired Visual Conditions
J Li, W Wu, S Wang, Z Pan, KA Lee, H Meng, H Li
arXiv preprint arXiv:2507.15294, 2025
12025
Listen to the Speaker in Your Gaze
H Yang, X Chen, J Li, H Huang, S Cai, H Li
2024 IEEE International Conference on Cybernetics and Intelligent Systems …, 2024
12024
QAMO: Quality-aware Multi-centroid One-class Learning For Speech Deepfake Detection
DT Truong, T Liu, R Tao, J Li, KA Lee, ES Chng
arXiv preprint arXiv:2509.20679, 2025
2025
Addressing Gradient Misalignment in Data-Augmented Training for Robust Speech Deepfake Detection
DT Truong, T Liu, J Li, R Tao, KA Lee, ES Chng
arXiv preprint arXiv:2509.20682, 2025
2025
Xi+: Uncertainty Supervision for Robust Speaker Embedding
J Li, KA Lee, DT Truong, T Liu, MW Mak
arXiv preprint arXiv:2509.05993, 2025
2025
Do We Really Need GNNs with Explicit Structural Modeling? MLPs Suffice for Language Model Representations
L Zhou, H Jiang, J Li, Z Zhao, F Jiang, W Chen, H Li
arXiv preprint arXiv:2506.21682, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–19