[go: up one dir, main page]

Follow
Zhiyao Duan
Zhiyao Duan
Professor of Electrical and Computer Engineering, University of Rochester
Verified email at rochester.edu - Homepage
Title
Cited by
Cited by
Year
Audio-visual event localization in unconstrained videos
Y Tian, J Shi, B Li, Z Duan, C Xu
Proceedings of the European Conference on Computer Vision (ECCV), 247-263, 2018
6962018
Hierarchical cross-modal talking face generation with dynamic pixel-wise loss
L Chen, RK Maddox, Z Duan, C Xu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
5632019
Automatic Music Transcription: An Overview
E Benetos, S Dixon, Z Duan, S Ewert
IEEE Signal Processing Magazine 36 (1), 20-30, 2018
4292018
Lip movements generation at a glance
L Chen, Z Li, RK Maddox, Z Duan, C Xu
Proceedings of the European Conference on Computer Vision (ECCV), 520-535, 2018
3512018
One-class learning towards synthetic voice spoofing detection
Y Zhang, F Jiang, Z Duan
IEEE Signal Processing Letters 28, 937-941, 2021
3382021
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
B Li, X Liu, K Dinesh, Z Duan, G Sharma
IEEE Transactions on Multimedia 21 (2), 522-535, 2018
2622018
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications
B Li, X Liu, K Dinesh, Z Duan, G Sharma
IEEE Transactions on Multimedia 21 (2), 522-535, 2018
2622018
Deep Cross-Modal Audio-Visual Generation
L Chen, S Srivastava, Z Duan, C Xu
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 349-357, 2017
2592017
Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions
Z Duan, B Pardo, C Zhang
IEEE Transactions on Audio, Speech, and Language Processing 18 (8), 2121-2133, 2010
2492010
Soundprism: An online system for score-informed source separation of music audio
Z Duan, B Pardo
IEEE Journal of Selected Topics in Signal Processing 5 (6), 1205-1215, 2011
1462011
Speech driven talking face generation from a single image and an emotion condition
SE Eskimez, Y Zhang, Z Duan
IEEE Transactions on Multimedia 24, 3480-3490, 2021
1302021
Unsupervised single-channel music source separation by average harmonic structure modeling
Z Duan, Y Zhang, C Zhang, Z Shi
IEEE Transactions on Audio, Speech, and Language Processing 16 (4), 766-778, 2008
1282008
Bidirectional GRU for sound event detection
R Lu, Z Duan
Detection and Classification of Acoustic Scenes and Events, 1-3, 2017
922017
Unsupervised Learning Approach to Feature Analysis for Automatic Speech Emotion Recognition
SE Eskimez, Z Duan, W Heinzelman
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
912018
RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning
N Jiang, S Jin, Z Duan, C Zhang
arXiv preprint arXiv:2002.03082, 2020
872020
Generating talking face landmarks from speech
SE Eskimez, RK Maddox, C Xu, Z Duan
International conference on latent variable analysis and signal separation …, 2018
792018
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021
X Chen, Y Zhang, G Zhu, Z Duan
arXiv preprint arXiv:2107.12018, 2021
772021
Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation
Y Zhang, B Pardo, Z Duan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (2), 429-441, 2018
752018
Multi-pitch streaming of harmonic sound mixtures
Z Duan, J Han, B Pardo
IEEE/ACM Transactions on Audio, Speech, and Language Processing 22 (1), 138-150, 2014
722014
Singfake: Singing voice deepfake detection
Y Zang, Y Zhang, M Heydari, Z Duan
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
682024
The system can't perform the operation now. Try again later.
Articles 1–20