| TCD-TIMIT: An audio-visual corpus of continuous speech N Harte, E Gillen Multimedia, IEEE Transactions on 17 (5), 603-615, 2015 | 324 | 2015 |
| ViSQOL: an objective speech quality model A Hines, J Skoglund, AC Kokaram, N Harte EURASIP Journal on Audio, Speech, and Music Processing 2015 (1), 13, 2015 | 241 | 2015 |
| Speech intelligibility prediction using a neurogram similarity index measure A Hines, N Harte Speech Communication 54 (2), 306-320, 2012 | 121 | 2012 |
| Phoneme-to-viseme mapping for visual speech recognition L Cappelletta, N Harte International Conference on Pattern Recognition Applications and Methods 2 …, 2012 | 102 | 2012 |
| Attention-based audio-visual fusion for robust automatic speech recognition G Sterpu, C Saam, N Harte Proceedings of the 20th ACM International conference on Multimodal …, 2018 | 97 | 2018 |
| ViSQOL: The virtual speech quality objective listener A Hines, J Skoglund, A Kokaram, N Harte IWAENC 2012; international workshop on acoustic signal enhancement, 1-4, 2012 | 88 | 2012 |
| ViSQOLAudio: An objective audio quality metric for low bitrate codecs A Hines, E Gillen, D Kelly, J Skoglund, A Kokaram, N Harte The Journal of the Acoustical Society of America 137 (6), EL449-EL455, 2015 | 78 | 2015 |
| Objective assessment of perceptual audio quality using ViSQOLAudio C Sloan, N Harte, D Kelly, AC Kokaram, A Hines IEEE Transactions on Broadcasting 63 (4), 693-705, 2017 | 71 | 2017 |
| Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA A Hines, J Skoglund, A Kokaram, N Harte 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 69 | 2013 |
| Multimodal continuous turn-taking prediction using multiscale rnns M Roddy, G Skantze, N Harte Proceedings of the 20th ACM International Conference on Multimodal …, 2018 | 62 | 2018 |
| Viseme definitions comparison for visual-only speech recognition L Cappelletta, N Harte 2011 19th European Signal Processing Conference, 2109-2113, 2011 | 58 | 2011 |
| The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction I Torre, E Carrigan, R McDonnell, K Domijan, K McCabe, N Harte Proceedings of the 12th ACM SIGGRAPH conference on motion, interaction and …, 2019 | 53 | 2019 |
| The limits of the mean opinion score for speech synthesis evaluation S Le Maguer, S King, N Harte Computer Speech & Language 84, 101577, 2024 | 51 | 2024 |
| TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications N Harte, E Gillen, A Hines 2015 Seventh International Workshop on Quality of Multimedia Experience …, 2015 | 51 | 2015 |
| Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs M Roddy, G Skantze, N Harte Proc. Interspeech 2018, 586-590, 2018 | 49 | 2018 |
| Speaker verification in score-ageing-quality classification space F Kelly, A Drygajlo, N Harte Computer Speech & Language 27 (5), 1068-1084, 2013 | 49 | 2013 |
| Speaker verification with long-term ageing data F Kelly, A Drygajlo, N Harte 2012 5th IAPR international conference on biometrics (ICB), 478-483, 2012 | 48 | 2012 |
| Speech intelligibility from image processing A Hines, N Harte Speech Communication 52 (9), 736-752, 2010 | 44 | 2010 |
| How to teach DNNs to pay attention to the visual modality in speech recognition G Sterpu, C Saam, N Harte IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1052-1064, 2020 | 39 | 2020 |
| Multi-resolution cepstral features for phoneme recognition across speech sub-bands P McCourt, S Vaseght, N Harte Proceedings of the 1998 IEEE International Conference on Acoustics, Speech …, 1998 | 39 | 1998 |