| Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025 | 1217 | 2025 |
| A novel scheme for speaker recognition using a phonetically-aware deep neural network Y Lei, N Scheffer, L Ferrer, M McLaren 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 811 | 2014 |
| Advances in deep neural network approaches to speaker recognition M McLaren, Y Lei, L Ferrer 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 171 | 2015 |
| Towards noise-robust speaker recognition using probabilistic linear discriminant analysis Y Lei, L Burget, L Ferrer, M Graciarena, N Scheffer 2012 IEEE international conference on acoustics, speech and signal …, 2012 | 125 | 2012 |
| Content-aware speaker recognition N Scheffer, Y Lei US Patent 9,336,781, 2016 | 119 | 2016 |
| Study of senone-based deep neural network approaches for spoken language recognition L Ferrer, Y Lei, M McLaren, N Scheffer IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (1), 105-116, 2015 | 104 | 2015 |
| Promoting robustness for speaker modeling in the community: the PRISM evaluation set L Ferrer, H Bratt, L Burget, H Cernocky, O Glembek, M Graciarena, ... Proceedings of NIST 2011 workshop, 1-7, 2011 | 98 | 2011 |
| Application of Convolutional Neural Networks to Language Identification in Noisy Conditions. Y Lei, L Ferrer, A Lawson, M McLaren, N Scheffer Odyssey, 2014 | 94 | 2014 |
| Application of convolutional neural networks to speaker recognition in noisy conditions. M McLaren, Y Lei, N Scheffer, L Ferrer INTERSPEECH, 686-690, 2014 | 92 | 2014 |
| A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION Y Lei, L Burget, N Scheffer | 92 | 2013 |
| Dialect classification via text-independent training and testing for Arabic, Spanish, and Chinese Y Lei, JHL Hansen IEEE Transactions on Audio, Speech, and Language Processing 19 (1), 85-96, 2010 | 86 | 2010 |
| ASR error detection using recurrent neural network language model and complementary ASR YC Tam, Y Lei, J Zheng, W Wang 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 79 | 2014 |
| A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation L Ferrer, M McLaren, N Scheffer, Y Lei, M Graciarena, V Mitra | 64 | 2013 |
| Evaluating robust features on deep neural networks for speech recognition in noisy and channel mismatched conditions. V Mitra, W Wang, H Franco, Y Lei, C Bartels, M Graciarena Interspeech, 895-899, 2014 | 57 | 2014 |
| All for one: feature combination for highly channel-degraded speech activity detection. M Graciarena, A Alwan, D Ellis, H Franco, L Ferrer, JHL Hansen, A Janin, ... INTERSPEECH, 709-713, 2013 | 55 | 2013 |
| Unscented transform for ivector-based noisy speaker recognition D Martinez, L Bürget, T Stafylakis, Y Lei, P Kenny, E Lleida 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 37 | 2014 |
| Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion M McLaren, N Scheffer, M Graciarena, L Ferrer, Y Lei 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 36 | 2013 |
| Dialect identification: Impact of differences between read versus spontaneous speech G Liu, Y Lei, JHL Hansen 2010 18th European Signal Processing Conference, 2003-2006, 2010 | 36 | 2010 |
| Robust feature front-end for speaker identification G Liu, Y Lei, JHL Hansen 2012 IEEE international conference on acoustics, speech and signal …, 2012 | 35 | 2012 |
| Multi-sample conversational voice verification N Scheffer, Y Lei, DA Bercow US Patent 9,251,792, 2016 | 34 | 2016 |