[go: up one dir, main page]

Follow
Daniel Galvez
Title
Cited by
Cited by
Year
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI.
D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ...
Interspeech, 2751-2755, 2016
10792016
The people's speech: A large-scale diverse english speech recognition dataset for commercial usage
D Galvez, G Diamos, J Ciro, JF Cerón, K Achorn, A Gopi, D Kanter, M Lam, ...
arXiv preprint arXiv:2111.09344, 2021
1322021
Multilingual spoken words corpus
M Mazumder, S Chitlangia, C Banbury, Y Kang, JM Ciro, K Achorn, ...
Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021
802021
Xingyu Na, Yiming Wang, and Sanjeev Khudanpur
D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar
Purely sequence-trained neural networks for ASR based on lattice-free MMI, 2016
152016
Speed of light exact greedy decoding for rnn-t speech recognition models on gpu
D Galvez, V Bataev, H Xu, T Kaldewey
arXiv preprint arXiv:2406.03791, 2024
122024
Emmett: Efficient multimodal machine translation training
P Żelasko, Z Chen, M Wang, D Galvez, O Hrinchuk, S Ding, K Hu, J Balam, ...
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
72025
Training and Inference Efficiency of Encoder-Decoder Speech Models
P Żelasko, K Dhawan, D Galvez, KC Puvvada, A Pasad, NR Koluguri, ...
arXiv preprint arXiv:2503.05931, 2025
62025
Label-looping: Highly efficient decoding for transducers
V Bataev, H Xu, D Galvez, V Lavrukhin, B Ginsburg
2024 IEEE Spoken Language Technology Workshop (SLT), 7-13, 2024
62024
Speech wikimedia: A 77 language multilingual speech dataset
RM Gómez, J Eusse, J Ciro, D Galvez, R Hileman, K Bollacker, D Kanter
arXiv preprint arXiv:2308.15710, 2023
62023
Gpu-accelerated wfst beam search decoder for ctc-based speech recognition
D Galvez, T Kaldewey
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
42023
LSH methods for data deduplication in a Wikipedia artificial dataset
J Ciro, D Galvez, T Schlippe, D Kanter
arXiv preprint arXiv:2112.11478, 2021
42021
Multiple-instance, cascaded classification for keyword spotting in narrow-band audio
A Abdulkader, K Nassar, M Mahmoud, D Galvez, C Patil
arXiv preprint arXiv:1711.08058, 2017
32017
Purely sequence trained neural networks for asr based on lattice free mmi (author’s manuscript)
D Povey, V Peddinti, D Galvez, P Ghahrmani, V Manohar, X Na, Y Wang, ...
The Johns Hopkins University Baltimore United States, Tech. Rep, 2016
32016
Speech Wikimedia: A 77 Language Multilingual Speech Dataset
R Mosquera Gómez, J Eusse, J Ciro, D Galvez, R Hileman, K Bollacker, ...
arXiv e-prints, arXiv: 2308.15710, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–14