Muhammad Uzair Khattak

Cited by

	All	Since 2021
Citations	2537	2536
h-index	9	9
i10-index	9	9

1400

700

350

1050

2023202420252026199 919 1376 34

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Salman KhanMBZUAI, Australian National UniversityVerified email at anu.edu.au
Fahad Shahbaz KhanMBZUAI, Linköping University SwedenVerified email at cvc.uab.es
Muzammal NaseerKhalifa University, University of Western AustraliaVerified email at ku.ac.ae
Muhammad MaazPhD Computer Vision at MBZUAIVerified email at mbzuai.ac.ae
Hanoona Abdul RasheedPhD Computer Vision Student at MBZUAIVerified email at mbzuai.ac.ae
Syed Talal WasimUniversity of BonnVerified email at mbzuai.ac.ae
Federico TombariGoogle, TU MunichVerified email at in.tum.de
Ming-Hsuan YangUniversity of California at Merced; Google DeepMindVerified email at ucmerced.edu
Muhammad Ferjad NaeemResearch Scientist, GoogleVerified email at google.com
Hanan GaniUniversity of California San Diego; Mohamed Bin Zayed University of Artificial IntelligenceVerified email at mbzuai.ac.ae
Noor Hazim HusseinMichigan State UniversityVerified email at msu.edu
Jameel HassanJohns Hopkins UniversityVerified email at jh.edu
Luc Van Goolprofessor computer vision INSAIT Sofia University, em. KU Leuven, em. ETHZ, Toyota Lab TRACEVerified email at insait.ai
Mubarak ShahTrustee Chair Professor of Computer Science, University of Central FloridaVerified email at crcv.ucf.edu
Saran KhaliqNational University of Sciences and Technology NUSTVerified email at seecs.edu.pk
wajahat hussainNational University of Sciences and Technology, IslamabadVerified email at seecs.edu.pk
Muhammad Latif AnjumAssistant Professor, SEECS, NUST, Islamabad.Verified email at seecs.edu.pk

Muhammad Uzair Khattak

EPFL

Verified email at epfl.ch - Homepage

Computer Vision Multi-modal Learning Video Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Maple: Multi-modal prompt learning MU Khattak, H Rasheed, M Maaz, S Khan, FS Khan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	1365	2023
Self-regulating prompts: Foundational model adaptation without forgetting MU Khattak, ST Wasim, M Naseer, S Khan, MH Yang, FS Khan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	386	2023
Fine-tuned clip models are efficient video learners H Rasheed, MU Khattak, M Maaz, S Khan, FS Khan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	290	2023
Bridging the gap between object and image-level representations for open-vocabulary detection H Bangalath, M Maaz, MU Khattak, SH Khan, F Shahbaz Khan Advances in Neural Information Processing Systems 35, 33781-33794, 2022	219	2022
Align your prompts: Test-time prompting with distribution alignment for zero-shot generalization J Abdul Samadh, MH Gani, N Hussein, MU Khattak, MM Naseer, ... Advances in Neural Information Processing Systems 36, 2024	137	2024
Learning to prompt with text only supervision for vision-language models MU Khattak, MF Naeem, M Naseer, L Van Gool, F Tombari Proceedings of the AAAI Conference on Artificial Intelligence 39 (4), 4230-4238, 2025	45	2025
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition ST Wasim, MU Khattak, M Naseer, S Khan, M Shah, FS Khan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	40	2023
Unimed-clip: Towards a unified image-text pretraining paradigm for diverse medical imaging modalities MU Khattak, S Kunhimon, M Naseer, S Khan, FS Khan arXiv preprint arXiv:2412.10372, 2024	29	2024
How Good is my Video-LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs MU Khattak, MF Naeem, J Hassan, M Naseer, F Tombari, FS Khan, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - WFM, 2025	21*	2025
System and method of bridging the gap between object and image-level representations for open-vocabulary detection HAR BANGALATH, M Muhammad, MU KHATTAK, S Khan, FS Khan US Patent 12,288,372, 2025	3	2025
Promptception: How Sensitive Are Large Multimodal Models to Prompts? MI Ismithdeen, MU Khattak, S Khan arXiv preprint arXiv:2509.03986, 2025	1	2025
Investigating and Improving Common Loop Closure Failures in Visual SLAM S Khaliq, ML Anjum, W Hussain, MU Khattak, M Rasool Autonomous Robots, 2022	1	2022
Multi-modal prompt learning for representation transfer on image recognition tasks MU KHATTAK, HAR BANGALATH, S Khan, FS Khan US Patent 12,493,741, 2025		2025
System and method for modeling local and global spatio-temporal context in video for video recognition ST WASIM, MU KHATTAK, M NASEER, S Khan, FS Khan US Patent App. 18/411,928, 2025		2025
Method and system for adapting a vision-language machine learning model for image recognition tasks MU KHATTAK, ST WASIM, M NASEER, S Khan, FS Khan US Patent App. 18/541,666, 2025		2025
Transferability of Vision-Language models with Prompt Learning MU Khattak		2023

The system can't perform the operation now. Try again later.

Articles 1–16

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors