Zhiyao Duan

Cited by

	All	Since 2021
Citations	6576	4882
h-index	35	29
i10-index	100	76

1300

650

325

975

20092010201120122013201420152016201720182019202020212022202320242025202617 23 42 48 76 94 96 106 140 225 322 463 612 821 1000 1195 1225 22

Public access

View all

66 articles

1 article

available

not available

Based on funding mandates

Co-authors

You ZhangDolby Laboratories, University of RochesterVerified email at rochester.edu
Chenliang XuAssociate Professor of Computer Science, University of RochesterVerified email at rochester.edu
Bryan PardoComputer Science, Northwestern UniversityVerified email at northwestern.edu
Changshui ZhangDept. Automation, Tsinghua University, Beijing, ChinaVerified email at mail.tsinghua.edu.cn
Ross K MaddoxUniversity of MichiganVerified email at umich.edu
Sefik Emre EskimezMicrosoftVerified email at microsoft.com
Ge ZhuAdobe Research, Music AIVerified email at adobe.com
Lele ChenResearch Scientist @ MetaVerified email at meta.com
Fei JiangTencent TechnologyVerified email at tencent.com
Rui LuTsinghua UniversityVerified email at tsinghua.org.cn
Yichi ZhangAppleVerified email at apple.com
Andrea Cogliati, PhDLighTopTech Corp.Verified email at lightoptech.com
Frank CwitkowitzPhD Student, University of RochesterVerified email at ur.rochester.edu
Gaurav SharmaUniversity of RochesterVerified email at rochester.edu
Mojtaba (Moji) HeydariApple, University of RochesterVerified email at apple.com
Yongyi ZangSmule, Inc.Verified email at smule.com
Wendi B HeinzelmanUniversity of RochesterVerified email at rochester.edu
Yapeng TianAssistant Professor, University of Texas at DallasVerified email at utdallas.edu
Jing ShiResearch Scientist, AdobeVerified email at adobe.com
Yujia YanUniversity of RochesterVerified email at rochester.edu

Zhiyao Duan

Professor of Electrical and Computer Engineering, University of Rochester

Verified email at rochester.edu - Homepage

Computer Audition Music Information Retrieval Speech Processing Audiovisual Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Audio-visual event localization in unconstrained videos Y Tian, J Shi, B Li, Z Duan, C Xu Proceedings of the European Conference on Computer Vision (ECCV), 247-263, 2018	696	2018
Hierarchical cross-modal talking face generation with dynamic pixel-wise loss L Chen, RK Maddox, Z Duan, C Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	563	2019
Automatic Music Transcription: An Overview E Benetos, S Dixon, Z Duan, S Ewert IEEE Signal Processing Magazine 36 (1), 20-30, 2018	429	2018
Lip movements generation at a glance L Chen, Z Li, RK Maddox, Z Duan, C Xu Proceedings of the European Conference on Computer Vision (ECCV), 520-535, 2018	351	2018
One-class learning towards synthetic voice spoofing detection Y Zhang, F Jiang, Z Duan IEEE Signal Processing Letters 28, 937-941, 2021	338	2021
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications B Li, X Liu, K Dinesh, Z Duan, G Sharma IEEE Transactions on Multimedia 21 (2), 522-535, 2018	262	2018
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications B Li, X Liu, K Dinesh, Z Duan, G Sharma IEEE Transactions on Multimedia 21 (2), 522-535, 2018	262	2018
Deep Cross-Modal Audio-Visual Generation L Chen, S Srivastava, Z Duan, C Xu Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 349-357, 2017	259	2017
Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions Z Duan, B Pardo, C Zhang IEEE Transactions on Audio, Speech, and Language Processing 18 (8), 2121-2133, 2010	249	2010
Soundprism: An online system for score-informed source separation of music audio Z Duan, B Pardo IEEE Journal of Selected Topics in Signal Processing 5 (6), 1205-1215, 2011	146	2011
Speech driven talking face generation from a single image and an emotion condition SE Eskimez, Y Zhang, Z Duan IEEE Transactions on Multimedia 24, 3480-3490, 2021	130	2021
Unsupervised single-channel music source separation by average harmonic structure modeling Z Duan, Y Zhang, C Zhang, Z Shi IEEE Transactions on Audio, Speech, and Language Processing 16 (4), 766-778, 2008	128	2008
Bidirectional GRU for sound event detection R Lu, Z Duan Detection and Classification of Acoustic Scenes and Events, 1-3, 2017	92	2017
Unsupervised Learning Approach to Feature Analysis for Automatic Speech Emotion Recognition SE Eskimez, Z Duan, W Heinzelman 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	91	2018
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning N Jiang, S Jin, Z Duan, C Zhang arXiv preprint arXiv:2002.03082, 2020	87	2020
Generating talking face landmarks from speech SE Eskimez, RK Maddox, C Xu, Z Duan International conference on latent variable analysis and signal separation …, 2018	79	2018
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021 X Chen, Y Zhang, G Zhu, Z Duan arXiv preprint arXiv:2107.12018, 2021	77	2021
Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation Y Zhang, B Pardo, Z Duan IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (2), 429-441, 2018	75	2018
Multi-pitch streaming of harmonic sound mixtures Z Duan, J Han, B Pardo IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (1), 138-150, 2014	72	2014
Singfake: Singing voice deepfake detection Y Zang, Y Zhang, M Heydari, Z Duan ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	68	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors