Muhammad Maaz

Cited by

	All	Since 2021
Citations	5155	5153
h-index	13	13
i10-index	13	13

3000

1500

750

2250

2022202320242025202617 396 1674 2952 103

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Salman KhanMBZUAI, Australian National UniversityVerified email at anu.edu.au
Fahad Shahbaz KhanMBZUAI, Linköping University SwedenVerified email at cvc.uab.es
Hanoona Abdul RasheedPhD Computer Vision Student at MBZUAIVerified email at mbzuai.ac.ae
Ming-Hsuan YangUniversity of California at Merced; Google DeepMindVerified email at ucmerced.edu
Abdelrahman ShakerPhD in Computer Vision, Mohamed Bin Zayed University of Artificial IntelligenceVerified email at mbzuai.ac.ae
Rao Muhammad AnwerMohamed bin Zayed University of Artificial Intelligence (MBZUAI)Verified email at aalto.fi
Muhammad Uzair KhattakEPFLVerified email at epfl.ch
Hisham CholakkalMohamed bin Zayed University of Artificial Intelligence (MBZUAI)Verified email at mbzuai.ac.ae
Syed Waqas ZamirSr. Research Scientist @ Microsoft AI for Good LabVerified email at microsoft.com
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon UVerified email at cs.cmu.edu
Sahal Shaji MullappillyPhD Computer Vision Student, MBZUAIVerified email at mbzuai.ac.ae
Michael FelsbergProfessor of Computer Vision, Linköping UniversityVerified email at liu.se
Timothy BaldwinMBZUAI and The University of MelbourneVerified email at unimelb.edu.au
Mubarak ShahTrustee Chair Professor of Computer Science, University of Central FloridaVerified email at crcv.ucf.edu
Shehan MunasingheMBZUAIVerified email at mbzuai.ac.ae
Rusiru ThusharaJohns Hopkins UniversityVerified email at jhu.edu
Dhanalaxmi GaddamMasters in Machine LearningVerified email at mbzuai.ac.ae
Hamid RezatofighiAssociate Professor, Monash University, Melbourne, AustraliaVerified email at monash.edu
Chenhui GouPhD candidate, Monash University;Verified email at monash.edu
Christoph FeichtenhoferMetaVerified email at fb.com

Muhammad Maaz

PhD Computer Vision at MBZUAI

Verified email at mbzuai.ac.ae - Homepage

Computer Vision Deep Learning Vision-Language Generative AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Maple: Multi-modal prompt learning MU Khattak, H Rasheed, M Maaz, S Khan, FS Khan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	1366	2023
Video-chatgpt: Towards detailed video understanding via large vision and language models M Maaz, H Rasheed, S Khan, FS Khan Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	1300	2024
UNETR++: delving into efficient and accurate 3D medical image segmentation A Shaker, M Maaz, H Rasheed, S Khan, MH Yang, FS Khan IEEE Transactions on Medical Imaging 43 (9), 3377-3390, 2024	450	2024
Edgenext: efficiently amalgamated cnn-transformer architecture for mobile vision applications M Maaz, A Shaker, H Cholakkal, S Khan, SW Zamir, RM Anwer, ... European conference on computer vision, 3-20, 2022	435	2022
Glamm: Pixel grounding large multimodal model H Rasheed, M Maaz, S Shaji, A Shaker, S Khan, H Cholakkal, RM Anwer, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	424	2024
Fine-tuned clip models are efficient video learners H Rasheed, MU Khattak, M Maaz, S Khan, FS Khan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	290	2023
Swiftformer: Efficient additive attention for transformer-based real-time mobile vision applications A Shaker, M Maaz, H Rasheed, S Khan, MH Yang, FS Khan Proceedings of the IEEE/CVF international conference on computer vision …, 2023	286	2023
Bridging the gap between object and image-level representations for open-vocabulary detection H Bangalath, M Maaz, MU Khattak, SH Khan, F Shahbaz Khan Advances in Neural Information Processing Systems 35, 33781-33794, 2022	219	2022
Class-agnostic object detection with multi-modal transformer M Maaz, H Rasheed, S Khan, FS Khan, RM Anwer, MH Yang European conference on computer vision, 512-531, 2022	162*	2022
Videogpt+: Integrating image and video encoders for enhanced video understanding M Maaz, H Rasheed, S Khan, F Khan arXiv preprint arXiv:2406.09418, 2024	96	2024
Pg-video-llava: Pixel grounding large video-language models S Munasinghe, R Thushara, M Maaz, HA Rasheed, S Khan, M Shah, ... arXiv preprint arXiv:2311.13435, 2023	52	2023
Perceptionlm: Open-access data and models for detailed visual understanding JH Cho, A Madotto, E Mavroudi, T Afouras, T Nagarajan, M Maaz, Y Song, ... Advances in Neural Information Processing Systems (NeurIPS Spotlight), 2025	33	2025
Palo: A polyglot large multimodal model for 5b people H Rasheed, M Maaz, A Shaker, S Khan, H Cholakal, RM Anwer, ... 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV …, 2025	31*	2025
Videomathqa: Benchmarking mathematical reasoning via multimodal understanding in videos H Rasheed, A Shaker, A Tang, M Maaz, MH Yang, S Khan, FS Khan arXiv preprint arXiv:2506.05349, 2025	5	2025
Self-supervised learning for fine-grained visual categorization M Maaz, HA Rasheed, D Gaddam arXiv preprint arXiv:2105.08788, 2021	3	2021
A culturally-diverse multilingual multimodal video benchmark & model BS Shafique, A Vayani, M Maaz, HA Rasheed, D Dissanayake, ... Proceedings of the 2025 Conference on Empirical Methods in Natural Language …, 2025	2	2025
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model A Shaker, M Maaz, C Gou, H Rezatofighi, S Khan, FS Khan arXiv preprint arXiv:2503.21782, 2025	1	2025
Video-CoM: Interactive Video Reasoning via Chain of Manipulations H Rasheed, M Zumri, M Maaz, MH Yang, FS Khan, S Khan arXiv preprint arXiv:2511.23477, 2025		2025
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models M Maaz, H Rasheed, FS Khan, S Khan arXiv preprint arXiv:2511.23478, 2025		2025

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors