[go: up one dir, main page]

Follow
Muhammad Uzair Khattak
Muhammad Uzair Khattak
Verified email at epfl.ch - Homepage
Title
Cited by
Cited by
Year
Maple: Multi-modal prompt learning
MU Khattak, H Rasheed, M Maaz, S Khan, FS Khan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
13652023
Self-regulating prompts: Foundational model adaptation without forgetting
MU Khattak, ST Wasim, M Naseer, S Khan, MH Yang, FS Khan
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
3862023
Fine-tuned clip models are efficient video learners
H Rasheed*, MU Khattak*, M Maaz, S Khan, FS Khan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
2902023
Bridging the gap between object and image-level representations for open-vocabulary detection
H Bangalath, M Maaz, MU Khattak, SH Khan, F Shahbaz Khan
Advances in Neural Information Processing Systems 35, 33781-33794, 2022
2192022
Align your prompts: Test-time prompting with distribution alignment for zero-shot generalization
J Abdul Samadh, MH Gani, N Hussein, MU Khattak, MM Naseer, ...
Advances in Neural Information Processing Systems 36, 2024
1372024
Learning to prompt with text only supervision for vision-language models
MU Khattak, MF Naeem, M Naseer, L Van Gool, F Tombari
Proceedings of the AAAI Conference on Artificial Intelligence 39 (4), 4230-4238, 2025
452025
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
ST Wasim, MU Khattak, M Naseer, S Khan, M Shah, FS Khan
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
402023
Unimed-clip: Towards a unified image-text pretraining paradigm for diverse medical imaging modalities
MU Khattak, S Kunhimon, M Naseer, S Khan, FS Khan
arXiv preprint arXiv:2412.10372, 2024
292024
How Good is my Video-LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs
MU Khattak, MF Naeem, J Hassan, M Naseer, F Tombari, FS Khan, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - WFM, 2025
21*2025
System and method of bridging the gap between object and image-level representations for open-vocabulary detection
HAR BANGALATH, M Muhammad, MU KHATTAK, S Khan, FS Khan
US Patent 12,288,372, 2025
32025
Promptception: How Sensitive Are Large Multimodal Models to Prompts?
MI Ismithdeen, MU Khattak, S Khan
arXiv preprint arXiv:2509.03986, 2025
12025
Investigating and Improving Common Loop Closure Failures in Visual SLAM
S Khaliq, ML Anjum, W Hussain, MU Khattak, M Rasool
Autonomous Robots, 2022
12022
Multi-modal prompt learning for representation transfer on image recognition tasks
MU KHATTAK, HAR BANGALATH, S Khan, FS Khan
US Patent 12,493,741, 2025
2025
System and method for modeling local and global spatio-temporal context in video for video recognition
ST WASIM, MU KHATTAK, M NASEER, S Khan, FS Khan
US Patent App. 18/411,928, 2025
2025
Method and system for adapting a vision-language machine learning model for image recognition tasks
MU KHATTAK, ST WASIM, M NASEER, S Khan, FS Khan
US Patent App. 18/541,666, 2025
2025
Transferability of Vision-Language models with Prompt Learning
MU Khattak
2023
The system can't perform the operation now. Try again later.
Articles 1–16