[go: up one dir, main page]

Follow
Masato Mimura
Masato Mimura
NTT corporation
Verified email at sap.ist.i.kyoto-u.ac.jp
Title
Cited by
Cited by
Year
Statistical speech enhancement based on probabilistic integration of variational autoencoder and non-negative matrix factorization
Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1682018
Acoustic-to-word attention-based model complemented with character-level CTC-based model
S Ueno, H Inaguma, M Mimura, T Kawahara
2018 IEEE international conference on acoustics, speech and signal …, 2018
782018
Distilling the knowledge of BERT for sequence-to-sequence ASR
H Futami, H Inaguma, S Ueno, M Mimura, S Sakai, T Kawahara
arXiv preprint arXiv:2008.03822, 2020
772020
Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition
K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (5), 960-971, 2019
742019
Leveraging sequence-to-sequence speech synthesis for enhancing acoustic-to-word speech recognition
M Mimura, S Ueno, H Inaguma, S Sakai, T Kawahara
2018 IEEE Spoken Language Technology Workshop (SLT), 477-484, 2018
732018
Multi-speaker sequence-to-sequence speech synthesis for data augmentation in acoustic-to-word speech recognition
S Ueno, M Mimura, S Sakai, T Kawahara
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
472019
Bayesian learning of a language model from continuous speech
G Neubig, M Mimura, S Mori, T Kawahara
IEICE TRANSACTIONS on Information and Systems 95 (2), 614-625, 2012
472012
Uyghur morpheme-based language models and ASR
M Ablimit, G Neubig, M Mimura, S Mori, T Kawahara, A Hamdulla
IEEE 10th International Conference on Signal Processing Proceedings, 581-584, 2010
472010
Learning a language model from continuous speech.
G Neubig, M Mimura, S Mori, T Kawahara
INTERSPEECH, 1053-1056, 2010
462010
Enhancing monotonic multihead attention for streaming asr
H Inaguma, M Mimura, T Kawahara
arXiv preprint arXiv:2005.09394, 2020
442020
Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks
M Mimura, S Sakai, T Kawahara
2017 IEEE automatic speech recognition and understanding workshop (ASRU …, 2017
342017
Continuous speech recognition consortium: an open repository for CSR tools and models
A Lee, T Kawahara, K Takeda, M Mimura, A Yamada, A Ito, K Itou, ...
312002
Waveform-domain speech enhancement using spectrogram encoding for robust speech recognition
H Shi, M Mimura, T Kawahara
IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 3049-3060, 2024
302024
Asr rescoring and confidence estimation with electra
H Futami, H Inaguma, M Mimura, S Sakai, T Kawahara
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
292021
Data augmentation for asr using tts via a discrete representation
S Ueno, M Mimura, S Sakai, T Kawahara
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 68-75, 2021
292021
Joint optimization of denoising autoencoder and dnn acoustic model based on multi-target learning for noisy speech recognition
M Mimura, S Sakai, T Kawahara
Proc. Interspeech 2016, 3803-3807, 2016
282016
Speech dereverberation using long short-term memory
M Mimura, S Sakai, T Kawahara
Proc. Interspeech 2015, 2435-2439, 2015
282015
Time-domain speech enhancement assisted by multi-resolution frequency encoder and decoder
H Shi, M Mimura, L Wang, J Dang, T Kawahara
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
272023
Fixed point properties and second bounded cohomology of universal lattices on Banach space
M Mimura
arXiv preprint arXiv:0904.4650, 2009
272009
Automatic transcription system for meetings of the Japanese national congress
Y Akita, M Mimura, T Kawahara
Proc. InterSpeech 2009, 84-87, 2009
252009
The system can't perform the operation now. Try again later.
Articles 1–20