[go: up one dir, main page]

Follow
Juan Carlos Leon Alcazar
Juan Carlos Leon Alcazar
KAUST
Verified email at uniandes.edu.co
Title
Cited by
Cited by
Year
Mad: A scalable dataset for language grounding in videos from movie audio descriptions
M Soldan, A Pardo, JL Alcázar, F Caba, C Zhao, S Giancola, B Ghanem
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1592022
Active speakers in context
JL Alcázar, F Caba, L Mai, F Perazzi, JY Lee, P Arbeláez, B Ghanem
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1162020
Maas: Multi-modal assignation for active speaker detection
JL Alcázar, F Caba, AK Thabet, B Ghanem
Proceedings of the IEEE/CVF International Conference on Computer Vision, 265-274, 2021
782021
Pivot: Prompting for video continual learning
A Villa, JL Alcázar, M Alfarra, K Alhamoud, J Hurtado, FC Heilbron, A Soto, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
772023
vclimb: A novel video class incremental learning benchmark
A Villa, K Alhamoud, V Escorcia, F Caba, JL Alcázar, B Ghanem
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
592022
Multi-view dynamic facial action unit detection
A Romero, J León, P Arbeláez
Image and Vision Computing 122, 103723, 2022
482022
End-to-end active speaker detection
JL Alcazar, M Cordes, C Zhao, B Ghanem
European Conference on Computer Vision, 126-143, 2022
452022
Moviecuts: A new dataset and benchmark for cut type recognition
A Pardo, FC Heilbron, JL Alcázar, A Thabet, B Ghanem
European Conference on Computer Vision, 668-685, 2022
382022
Learning to cut by watching movies
A Pardo, F Caba, JL Alcázar, AK Thabet, B Ghanem
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
292021
Behind the magic, merlim: Multi-modal evaluation benchmark for large image-language models
A Villa, J Léon, A Soto, B Ghanem
Proceedings of the Computer Vision and Pattern Recognition Conference, 492-502, 2025
232025
Just a glimpse: Rethinking temporal information for video continual learning
L Alssum, JL Alcazar, M Ramazanova, C Zhao, B Ghanem
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
162023
Learning to read analog gauges from synthetic data
J Leon-Alcazar, Y Alnumay, C Zheng, H Trigui, S Patel, B Ghanem
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
102024
Opentad: A unified framework and comprehensive study of temporal action detection
S Liu, C Zhao, F Zohra, M Soldan, A Pardo, M Xu, L Alssum, ...
Proceedings of the Computer Vision and Pattern Recognition Conference, 2625-2635, 2025
72025
Compressed-language models for understanding compressed file formats: a jpeg exploration
JC Pérez, A Pardo, M Soldan, H Itani, J Leon-Alcazar, B Ghanem
arXiv preprint arXiv:2405.17146, 2024
72024
APES: Audiovisual person search in untrimmed video
JL Alcazar, F Caba, L Mai, F Perazzi, JY Lee, P Arbelaez, B Ghanem
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
62021
EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models
A Villa, JL Alcázar, M Alfarra, V Araujo, A Soto, B Ghanem
arXiv preprint arXiv:2501.02699, 2025
42025
MAIN: Multi-Attention Instance Network for Video Segmentation
JL Alcazar, MA Bravo, AK Thabet, G Jeanneret, T Brox, P Arbelaez, ...
arXiv preprint arXiv:1904.05847, 2019
42019
Motion-Aware Concept Alignment for Consistent Video Editing
T Zhang, JCL Alcazar, B Ghanem
arXiv preprint arXiv:2506.01004, 2025
22025
Usability evaluation of a mobile tool to support prenatal examination
JC Leon, A Aponte, S Vega, E Romero
IX International Seminar On Medical Information Processing And Analysis 8922 …, 2013
22013
Markerless Analysis of Gait Patterns in the Parkinson's Disease
L Alcázar, J Carlos
Universidad Nacional de Colombia, 2012
22012
The system can't perform the operation now. Try again later.
Articles 1–20