[go: up one dir, main page]

Follow
Diego Castán
Diego Castán
Verified email at ibm.com
Title
Cited by
Cited by
Year
The speakers in the wild (SITW) speaker recognition database.
M McLaren, L Ferrer, D Castan, A Lawson
Interspeech, 818-822, 2016
3352016
How to train your speaker embeddings extractor
ML McLaren, D Castan, MK Nandwana, L Ferrer, E Yilmaz
Les Sables d'Olonne, France: ISCA, 2018
732018
The 2016 Speakers in the Wild Speaker Recognition Evaluation.
M McLaren, L Ferrer, D Castan, A Lawson
Interspeech, 823-827, 2016
672016
Virtual health assistant for promotion of well-being and independent living
D Vergyri, DC Lavilla, G Acharya, D Sahner, E Shriberg, JB Rogers, ...
US Patent 10,726,846, 2020
412020
Speaker detection in the wild: Lessons learned from JSALT 2019
P García, J Villalba, H Bredin, J Du, D Castan, A Cristia, L Bullock, L Guo, ...
arXiv preprint arXiv:1912.00938, 2019
392019
Tampered speaker inconsistency detection with phonetically aware audio-visual features
P Korshunov, M Halstead, D Castan, M Graciarena, M McLaren, B Burns, ...
International conference on machine learning, 2019
372019
Toward fail-safe speaker recognition: Trial-based calibration with a reject option
L Ferrer, MK Nandwana, M McLaren, D Castan, A Lawson
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (1), 140-153, 2018
322018
Audio segmentation-by-classification approach based on factor analysis in broadcast news domain
D Castán, A Ortega, A Miguel, E Lleida
EURASIP Journal on Audio, Speech, and Music Processing 2014 (1), 34, 2014
312014
Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains
D Castán, D Tavarez, P Lopez-Otero, J Franco-Pedroso, H Delgado, ...
EURASIP Journal on Audio, Speech, and Music Processing 2015 (1), 33, 2015
282015
ViVoLab and CVLab-MediaEval 2014: Violent Scenes Detection Affect Task.
D Castán, M Rodríguez, A Ortega, C Orrite, E Lleida
MediaEval, 2014
272014
Analysis of Critical Metadata Factors for the Calibration of Speaker Recognition Systems.
MK Nandwana, L Ferrer, M McLaren, D Castan, A Lawson
INTERSPEECH, 4325-4329, 2019
252019
Detecting synthetic speech manipulation in real audio recordings
MH Rahman, M Graciarena, D Castan, C Cobo-Kroenke, M McLaren, ...
2022 IEEE International Workshop on Information Forensics and Security (WIFS …, 2022
242022
Real-time class recognition for an audio stream
DC Lavilla, H Bratt, ML McLaren
US Patent 11,024,291, 2021
222021
The speakers in the wild speaker recognition challenge plan
M McLaren, A Lawson, L Ferrer, D Castan, M Graciarena
Proceedings of the Interspeech, 2016
192016
Language Recognition Using Triplet Neural Networks.
V Mingote, D Castan, M McLaren, MK Nandwana, AO Giménez, E Lleida, ...
INTERSPEECH, 4025-4029, 2019
152019
The Albayzin 2012 Audio Segmentation Evaluation
A Ortega, D Castan, A Miguel, E Lleida
152012
Approaches to Multi-domain Language Recognition.
M Mclaren, MK Nandwana, D Castán, L Ferrer
Odyssey, 90-97, 2018
132018
Context-aware communicator for all
P García, E Lleida, D Castán, JM Marcos, D Romero
International Conference on Universal Access in Human-Computer Interaction …, 2015
122015
Speaker-targeted synthetic speech detection
D Castan, MH Rahman, S Bakst, C Cobo-Kroenke, M McLaren, ...
Sandia National Lab.(SNL-NM), Albuquerque, NM (United States), 2022
112022
Inferring Stance from Prosody.
NG Ward, JC Carlson, O Fuentes, D Castan, E Shriberg, A Tsiartas
INTERSPEECH, 1447-1451, 2017
112017
The system can't perform the operation now. Try again later.
Articles 1–20