[go: up one dir, main page]

Follow
Kai Li
Title
Cited by
Cited by
Year
Transformers and audio detection tasks: An overview
K Zaman, K Li, M Sah, C Direkoglu, S Okada, M Unoki
Digital Signal Processing 158, 104956, 2025
212025
Contributions of jitter and shimmer in the voice for fake audio detection
K Li, X Lu, M Akagi, M Unoki
IEEE Access 11, 84689-84698, 2023
172023
Analysis of spectro-temporal modulation representation for deep-fake speech detection
H Cheng, CO Mawalim, K Li, L Wang, M Unoki
2023 Asia Pacific Signal and Information Processing Association Annual …, 2023
102023
Segment-level effects of gender, nationality and emotion information on text-independent speaker verification
K Li, M Akagi, Y Wu
INTERSPEECH, 2020, pp.2987-2991, 2020
102020
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
K Li, S Li, X Lu, M Akagi, M Liu, L Zhang, C Zeng, L Wang, J Dang, ...
INTERSPEECH, 2022, 2022
92022
Deepfake speech detection: approaches from acoustic features related to auditory perception to deep neural networks
M Unoki, K Li, A Chaiwongyen, QH Nguyen, K Zaman
IEICE Transactions on Information and Systems, 2024
82024
UNSUPERVISED ANOMALOUS SOUND DETECTION FOR MACHINE CONDITION MONITORING USING TEMPORAL MODULATION FEATURES ON GAMMATONE AUDITORY FILTERBANK
K Li, QH Nguyen, Y Ota, M Unoki
DCASE2022, 2022
72022
Advances in speech separation: Techniques, challenges, and future trends
K Li, G Chen, W Sang, Y Luo, Z Chen, S Wang, S He, ZQ Wang, A Li, ...
arXiv preprint arXiv:2508.10830, 2025
62025
Ability of human auditory perception to distinguish human-imitated speech
K Zaman, K Li, IJAM Samiul, Y Uezu, S Kidani, M Unoki
IEEE Access, 2025
62025
Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment
QH Nguyen, K Li, M Unoki
INTERSPEECH, 2022, 2022
62022
Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection
K Li, Y Wang, ML Nguyen, M Akagi, M Unoki
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
52022
Deep spectro-temporal artifacts for detecting synthesized speech
X Liu, M Liu, L Zhang, L Zhang, C Zeng, K Li, N Li, KA Lee, L Wang, ...
Proceedings of the 1st International Workshop on Deepfake Detection for …, 2022
52022
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks
K Li, K Zaman, X Li, M Akagi, J Dang, M Unoki
IEEE Transactions on Audio, Speech and Language Processing, 2025
42025
Berp: A blind estimator of room acoustic and physical parameters for single-channel noisy speech signals
L Wang, Y Lu, Z Gao, K Li, J Huang, Y Kong, S Okada
arXiv preprint arXiv:2405.04476, 2024
42024
BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals
L Wang, Y Lu, Z Gao, K Li, J Huang, Y Kong, S Okada
IEEE Transactions on Audio, Speech and Language Processing, 2025
32025
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network
K Li, X Lu, M Akagi, J Dang, S Li, M Unoki
IEEE EUSIPCO, 2022, 2022
32022
Study on simultaneous estimation of glottal source and vocal tract parameters by armax-lf model for speech analysis/synthesis
K Li, M Unoki, Y Li, J Dang, M Akagi
2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021
22021
Modeling and Estimation of Vocal Tract and Glottal Source Parameters Using ARMAX-LF Model
K Lia, M Akagia, Y Lib, M Unokia
arXiv preprint arXiv:2410.04704, 2024
12024
Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection
K Li, DK Tran, X Lu, M Akagi, M Unoki
2023 31st European Signal Processing Conference (EUSIPCO), 201-205, 2023
12023
Study on Ability of Human Auditory Perception to Distinguish Human-imitated Speech
K ZAMAN, IJAM SAMIUL, K LI, Y UEZU, S KIDANI, M UNOKI
2025
The system can't perform the operation now. Try again later.
Articles 1–20