Juan Carlos Leon Alcazar

Cited by

	All	Since 2021
Citations	734	711
h-index	11	11
i10-index	12	12

240

120

180

2019202020212022202320242025202611 6 27 85 148 210 233 7

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Bernard GhanemProfessor, King Abdullah University of Science and TechnologyVerified email at kaust.edu.sa
Fabian Caba HeilbronResearch Assistant, King Abdullah University of Science and TechnologyVerified email at kaust.edu.sa
Alejandro PardoPhD StudentVerified email at kaust.edu.sa
Pablo ArbelaezUniversidad de los Andes, Bogota, ColombiaVerified email at uniandes.edu.co
Chen ZhaoIncoming Professor at Harbin Institute of Technology, Shenzhen; Research Scientist at KAUSTVerified email at kaust.edu.sa
Ali ThabetMeta Superintelligence LabVerified email at meta.com
Mattia SoldanSenior Research Engineer, AdobeVerified email at adobe.com
Kumail AlhamoudPhD Student, MITVerified email at mit.edu
Victor EscorciaSamsung AI Center - CambridgeVerified email at kaust.edu.sa
Maria A. BravoPost-Doctoral researcher at TUM and Helmholtz Munich, GermanyVerified email at cs.uni-freiburg.de
Thomas BroxUniversity of FreiburgVerified email at cs.uni-freiburg.de
Guillaume JeanneretPostdoc at ISIR lab - Sorbonne UniversitéVerified email at isir.upmc.fr

Juan Carlos Leon Alcazar

KAUST

Verified email at uniandes.edu.co

Computer Vision Video analysis Continual Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mad: A scalable dataset for language grounding in videos from movie audio descriptions M Soldan, A Pardo, JL Alcázar, F Caba, C Zhao, S Giancola, B Ghanem Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	159	2022
Active speakers in context JL Alcázar, F Caba, L Mai, F Perazzi, JY Lee, P Arbeláez, B Ghanem Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	116	2020
Maas: Multi-modal assignation for active speaker detection JL Alcázar, F Caba, AK Thabet, B Ghanem Proceedings of the IEEE/CVF International Conference on Computer Vision, 265-274, 2021	78	2021
Pivot: Prompting for video continual learning A Villa, JL Alcázar, M Alfarra, K Alhamoud, J Hurtado, FC Heilbron, A Soto, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	77	2023
vclimb: A novel video class incremental learning benchmark A Villa, K Alhamoud, V Escorcia, F Caba, JL Alcázar, B Ghanem Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	59	2022
Multi-view dynamic facial action unit detection A Romero, J León, P Arbeláez Image and Vision Computing 122, 103723, 2022	48	2022
End-to-end active speaker detection JL Alcazar, M Cordes, C Zhao, B Ghanem European Conference on Computer Vision, 126-143, 2022	45	2022
Moviecuts: A new dataset and benchmark for cut type recognition A Pardo, FC Heilbron, JL Alcázar, A Thabet, B Ghanem European Conference on Computer Vision, 668-685, 2022	38	2022
Learning to cut by watching movies A Pardo, F Caba, JL Alcázar, AK Thabet, B Ghanem Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	29	2021
Behind the magic, merlim: Multi-modal evaluation benchmark for large image-language models A Villa, J Léon, A Soto, B Ghanem Proceedings of the Computer Vision and Pattern Recognition Conference, 492-502, 2025	23	2025
Just a glimpse: Rethinking temporal information for video continual learning L Alssum, JL Alcazar, M Ramazanova, C Zhao, B Ghanem Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	16	2023
Learning to read analog gauges from synthetic data J Leon-Alcazar, Y Alnumay, C Zheng, H Trigui, S Patel, B Ghanem Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024	10	2024
Opentad: A unified framework and comprehensive study of temporal action detection S Liu, C Zhao, F Zohra, M Soldan, A Pardo, M Xu, L Alssum, ... Proceedings of the Computer Vision and Pattern Recognition Conference, 2625-2635, 2025	7	2025
Compressed-language models for understanding compressed file formats: a jpeg exploration JC Pérez, A Pardo, M Soldan, H Itani, J Leon-Alcazar, B Ghanem arXiv preprint arXiv:2405.17146, 2024	7	2024
APES: Audiovisual person search in untrimmed video JL Alcazar, F Caba, L Mai, F Perazzi, JY Lee, P Arbelaez, B Ghanem Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	6	2021
EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models A Villa, JL Alcázar, M Alfarra, V Araujo, A Soto, B Ghanem arXiv preprint arXiv:2501.02699, 2025	4	2025
MAIN: Multi-Attention Instance Network for Video Segmentation JL Alcazar, MA Bravo, AK Thabet, G Jeanneret, T Brox, P Arbelaez, ... arXiv preprint arXiv:1904.05847, 2019	4	2019
Motion-Aware Concept Alignment for Consistent Video Editing T Zhang, JCL Alcazar, B Ghanem arXiv preprint arXiv:2506.01004, 2025	2	2025
Usability evaluation of a mobile tool to support prenatal examination JC Leon, A Aponte, S Vega, E Romero IX International Seminar On Medical Information Processing And Analysis 8922 …, 2013	2	2013
Markerless Analysis of Gait Patterns in the Parkinson's Disease L Alcázar, J Carlos Universidad Nacional de Colombia, 2012	2	2012

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors