| Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 168 | 2018 |
| Acoustic-to-word attention-based model complemented with character-level CTC-based model S Ueno, H Inaguma, M Mimura, T Kawahara 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 78 | 2018 |
| Distilling the knowledge of BERT for sequence-to-sequence ASR H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara arXiv preprint arXiv:2008.03822, 2020 | 77 | 2020 |
| Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (5), 960-971, 2019 | 74 | 2019 |
| Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition M Mimura, S Ueno, H Inaguma, S Sakai, T Kawahara 2018 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2018 | 73 | 2018 |
| Multi-speaker sequence-to-sequence speech synthesis for data augmentation in acoustic-to-word speech recognition S Ueno, M Mimura, S Sakai, T Kawahara ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 47 | 2019 |
| Bayesian learning of a language model from continuous speech G Neubig, M Mimura, S Mori, T Kawahara IEICE TRANSACTIONS on Information and Systems 95 (2), 614-625, 2012 | 47 | 2012 |
| Uyghur morpheme-based language models and ASR M Ablimit, G Neubig, M Mimura, S Mori, T Kawahara, A Hamdulla IEEE 10th International Conference on Signal Processing Proceedings, 581-584, 2010 | 47 | 2010 |
| Learning a language model from continuous speech. G Neubig, M Mimura, S Mori, T Kawahara INTERSPEECH, 1053-1056, 2010 | 46 | 2010 |
| Enhancing monotonic multihead attention for streaming asr H Inaguma, M Mimura, T Kawahara arXiv preprint arXiv:2005.09394, 2020 | 44 | 2020 |
| Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks M Mimura, S Sakai, T Kawahara 2017 IEEE automatic speech recognition and understanding workshop (ASRU …, 2017 | 34 | 2017 |
| Continuous speech recognition consortium: an open repository for CSR tools and models A Lee, T Kawahara, K Takeda, M Mimura, A Yamada, A Ito, K Itou, ... | 31 | 2002 |
| Waveform-domain speech enhancement using spectrogram encoding for robust speech recognition H Shi, M Mimura, T Kawahara IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 3049-3060, 2024 | 30 | 2024 |
| Asr rescoring and confidence estimation with electra H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 29 | 2021 |
| Data augmentation for asr using tts via a discrete representation S Ueno, M Mimura, S Sakai, T Kawahara 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 68-75, 2021 | 29 | 2021 |
| Joint optimization of denoising autoencoder and dnn acoustic model based on multi-target learning for noisy speech recognition M Mimura, S Sakai, T Kawahara Proc. Interspeech 2016, 3803-3807, 2016 | 28 | 2016 |
| Speech dereverberation using long short-term memory M Mimura, S Sakai, T Kawahara Proc. Interspeech 2015, 2435-2439, 2015 | 28 | 2015 |
| Time-domain speech enhancement assisted by multi-resolution frequency encoder and decoder H Shi, M Mimura, L Wang, J Dang, T Kawahara ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 27 | 2023 |
| Fixed point properties and second bounded cohomology of universal lattices on Banach space M Mimura arXiv preprint arXiv:0904.4650, 2009 | 27 | 2009 |
| Automatic transcription system for meetings of the Japanese national congress Y Akita, M Mimura, T Kawahara Proc. InterSpeech 2009, 84-87, 2009 | 25 | 2009 |