Kai Li

Cited by

	All	Since 2021
Citations	128	128
h-index	7	7
i10-index	4	4

2021202220232024202520263 4 11 27 71 11

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Masashi UnokiJAISTVerified email at jaist.ac.jp
Masato AkagiProfessor of Japan Advanced Institute of Science and TechnologyVerified email at jaist.ac.jp
Khalid ZAMAN, PhDJapan Advance Institute of Science and TechnologyVerified email at jaist.ac.jp
Jianwu DangJAIST, Japan / SIAT, ChinaVerified email at jaist.ac.jp
Xugang LuNational Institute of Information and Communications Technology (NICT), JapanVerified email at nict.go.jp
Xingfeng LiCity University of MacauVerified email at cityu.edu.mo

Kai Li

Japan Advanced Institute of Science and Technology

Verified email at jaist.ac.jp

Speaker individuality Speaker anonymization Physiological parameters estimation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformers and audio detection tasks: An overview K Zaman, K Li, M Sah, C Direkoglu, S Okada, M Unoki Digital Signal Processing 158, 104956, 2025	21	2025
Contributions of jitter and shimmer in the voice for fake audio detection K Li, X Lu, M Akagi, M Unoki IEEE Access 11, 84689-84698, 2023	17	2023
Analysis of spectro-temporal modulation representation for deep-fake speech detection H Cheng, CO Mawalim, K Li, L Wang, M Unoki 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023	10	2023
Segment-level effects of gender, nationality and emotion information on text-independent speaker verification K Li, M Akagi, Y Wu INTERSPEECH, 2020, pp.2987-2991, 2020	10	2020
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection K Li, S Li, X Lu, M Akagi, M Liu, L Zhang, C Zeng, L Wang, J Dang, ... INTERSPEECH, 2022, 2022	9	2022
Deepfake speech detection: approaches from acoustic features related to auditory perception to deep neural networks M Unoki, K Li, A Chaiwongyen, QH Nguyen, K Zaman IEICE Transactions on Information and Systems, 2024	8	2024
UNSUPERVISED ANOMALOUS SOUND DETECTION FOR MACHINE CONDITION MONITORING USING TEMPORAL MODULATION FEATURES ON GAMMATONE AUDITORY FILTERBANK K Li, QH Nguyen, Y Ota, M Unoki DCASE2022, 2022	7	2022
Advances in speech separation: Techniques, challenges, and future trends K Li, G Chen, W Sang, Y Luo, Z Chen, S Wang, S He, ZQ Wang, A Li, ... arXiv preprint arXiv:2508.10830, 2025	6	2025
Ability of human auditory perception to distinguish human-imitated speech K Zaman, K Li, IJAM Samiul, Y Uezu, S Kidani, M Unoki IEEE Access, 2025	6	2025
Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment QH Nguyen, K Li, M Unoki INTERSPEECH, 2022, 2022	6	2022
Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection K Li, Y Wang, ML Nguyen, M Akagi, M Unoki 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022	5	2022
Deep spectro-temporal artifacts for detecting synthesized speech X Liu, M Liu, L Zhang, L Zhang, C Zeng, K Li, N Li, KA Lee, L Wang, ... Proceedings of the 1st International Workshop on Deepfake Detection for …, 2022	5	2022
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks K Li, K Zaman, X Li, M Akagi, J Dang, M Unoki IEEE Transactions on Audio, Speech and Language Processing, 2025	4	2025
Berp: A blind estimator of room acoustic and physical parameters for single-channel noisy speech signals L Wang, Y Lu, Z Gao, K Li, J Huang, Y Kong, S Okada arXiv preprint arXiv:2405.04476, 2024	4	2024
BERP: A Blind Estimator of Room Parameters for Single-Channel Noisy Speech Signals L Wang, Y Lu, Z Gao, K Li, J Huang, Y Kong, S Okada IEEE Transactions on Audio, Speech and Language Processing, 2025	3	2025
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network K Li, X Lu, M Akagi, J Dang, S Li, M Unoki IEEE EUSIPCO, 2022, 2022	3	2022
Study on simultaneous estimation of glottal source and vocal tract parameters by armax-lf model for speech analysis/synthesis K Li, M Unoki, Y Li, J Dang, M Akagi 2021 Asia-Pacific Signal and Information Processing Association Annual …, 2021	2	2021
Modeling and Estimation of Vocal Tract and Glottal Source Parameters Using ARMAX-LF Model K Lia, M Akagia, Y Lib, M Unokia arXiv preprint arXiv:2410.04704, 2024	1	2024
Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection K Li, DK Tran, X Lu, M Akagi, M Unoki 2023 31st European Signal Processing Conference (EUSIPCO), 201-205, 2023	1	2023
Study on Ability of Human Auditory Perception to Distinguish Human-imitated Speech K ZAMAN, IJAM SAMIUL, K LI, Y UEZU, S KIDANI, M UNOKI		2025

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors