[go: up one dir, main page]

Follow
Matthew Wiesner
Matthew Wiesner
Research Scientist, Johns Hopkins University
Verified email at jhu.edu - Homepage
Title
Cited by
Cited by
Year
ESPnet: End-to-end speech processing toolkit
S Watanabe, T Hori, S Karita, T Hayashi, J Nishitoba, Y Unno, NEY Soplin, ...
arXiv preprint arXiv:1804.00015, 2018
19612018
The multilingual tedx corpus for speech recognition and translation
E Salesky, M Wiesner, J Bremerman, R Cattoni, M Negri, M Turchi, ...
arXiv preprint arXiv:2102.01757, 2021
1782021
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling
J Cho, MK Baskar, R Li, M Wiesner, SH Mallidi, N Yalta, M Karafiat, ...
2018 IEEE Spoken Language Technology Workshop (SLT), 521-527, 2018
1662018
Findings of the iwslt 2022 evaluation campaign
A Anastasopoulos, L Barrault, L Bentivogli, M Zanon-Boito, O Bojar, ...
Proceedings of the 19th international conference on spoken language …, 2022
1252022
Massively multilingual adversarial speech recognition
A Oliver, M Wiesner, S Watanabe, D Yarowsky
Proceedings of North American Chapter of the Association for Computational …, 2019
92*2019
The CHiME-7 DASR challenge: Distant meeting transcription with multiple devices in diverse scenarios
S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ...
arXiv preprint arXiv:2306.13734, 2023
882023
Multi-modal data augmentation for end-to-end ASR
A Renduchintala, S Ding, M Wiesner, S Watanabe
arXiv preprint arXiv:1803.10299, 2018
732018
The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
J Trmal, M Wiesner, V Peddinti, X Zhang, P Ghahremani, Y Wang, ...
Interspeech, 3597-3601, 2017
502017
A corpus for large-scale phonetic typology
E Salesky, E Chodroff, T Pimentel, M Wiesner, R Cotterell, AW Black, ...
arXiv preprint arXiv:2005.13962, 2020
402020
The CHiME-8 DASR challenge for generalizable and array agnostic distant automatic speech recognition and diarization
S Cornell, T Park, S Huang, C Boeddeker, X Chang, M Maciejewski, ...
arXiv preprint arXiv:2407.16447, 2024
312024
Towards zero-shot code-switched speech recognition
B Yan, M Wiesner, O Klejch, P Jyothi, S Watanabe
ICASSP 2023-2023 IEEE International Conference On Acoustics, Speech And …, 2023
262023
Topic identification for speech without asr
C Liu, J Trmal, M Wiesner, C Harman, S Khudanpur
arXiv preprint arXiv:1703.07476, 2017
232017
Less peaky and more accurate ctc forced alignment by label priors
R Huang, X Zhang, Z Ni, L Sun, M Hira, J Hwang, V Manohar, V Pratap, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
202024
Pretraining by backtranslation for end-to-end ASR in low-resource settings
M Wiesner, A Renduchintala, S Watanabe, C Liu, N Dehak, S Khudanpur
arXiv preprint arXiv:1812.03919, 2018
20*2018
Analysis of multilingual sequence-to-sequence speech recognition systems
M Karafiát, MK Baskar, S Watanabe, T Hori, M Wiesner, J Černocký
arXiv preprint arXiv:1811.03451, 2018
182018
Automatic speech recognition and topic identification for almost-zero-resource languages
M Wiesner, C Liu, L Ondel, C Harman, V Manohar, J Trmal, Z Huang, ...
arXiv preprint arXiv:1802.08731, 2018
172018
Target speaker asr with whisper
A Polok, D Klement, M Wiesner, S Khudanpur, J Černocký, L Burget
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
162025
HLTCOE JHU submission to the Voice Privacy challenge 2024
HL Xinyuan, Z Cai, A Garg, K Duh, LP García-Perera, S Khudanpur, ...
arXiv preprint arXiv:2409.08913, 2024
162024
End-to-end ASR to jointly predict transcriptions and linguistic annotations
M Omachi, Y Fujita, S Watanabe, M Wiesner
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
162021
Bypass temporal classification: Weakly supervised automatic speech recognition with imperfect transcripts
D Gao, M Wiesner, H Xu, LP Garcia, D Povey, S Khudanpur
arXiv preprint arXiv:2306.01031, 2023
132023
The system can't perform the operation now. Try again later.
Articles 1–20