| Multiscale vision transformers H Fan*, B Xiong*, K Mangalam*, Y Li*, Z Yan, J Malik, C Feichtenhofer* IEEE Conference on Computer Vision and Pattern Recognition, 2021 | 2034 | 2021 |
| Ego4d: Around the world in 3,000 hours of egocentric video K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ... IEEE Conference on Computer Vision and Pattern Recognition, 2022 | 1686 | 2022 |
| Mvitv2: Improved multiscale vision transformers for classification and detection Y Li, CY Wu, H Fan, K Mangalam, B Xiong, J Malik, C Feichtenhofer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1201 | 2022 |
| It is not the journey but the destination: Endpoint conditioned trajectory prediction K Mangalam, H Girase, S Agarwal, KH Lee, E Adeli, J Malik, A Gaidon European Conference on Computer Vision, 2020 | 711 | 2020 |
| Egoschema: A diagnostic benchmark for very long-form video language understanding K Mangalam, R Akshulakov, J Malik Advances in Neural Information Processing Systems 36, 46212-46244, 2023 | 512 | 2023 |
| From goals, waypoints & paths to long term human trajectory forecasting K Mangalam, Y An, H Girase, J Malik IEEE International Conference on Computer Vision, 2021 | 427 | 2021 |
| Long-term human motion prediction with scene context Z Cao, H Gao, K Mangalam, QZ Cai, M Vo, J Malik European Conference on Computer Vision, 2020 | 343 | 2020 |
| MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition CY Wu*, Y Li*, K Mangalam, H Fan, B Xiong, J Malik, C Feichtenhofer* arXiv preprint arXiv:2201.08383, 2022 | 333 | 2022 |
| Sequential modeling enables scalable learning for large vision models Y Bai, X Geng, K Mangalam, A Bar, AL Yuille, T Darrell, J Malik, AA Efros Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 287 | 2024 |
| Future person localization in first-person videos T Yagi, K Mangalam, R Yonetani, Y Sato IEEE Conference on Computer Vision and Pattern Recognition, 2018 | 254 | 2018 |
| Squeezeformer: An efficient transformer for automatic speech recognition S Kim, A Gholami, A Shaw, N Lee, K Mangalam, J Malik, MW Mahoney, ... Advances in Neural Information Processing Systems 35, 9361-9373, 2022 | 219 | 2022 |
| Speculative decoding with big little decoder S Kim, K Mangalam, S Moon, J Malik, MW Mahoney, A Gholami, ... Advances in Neural Information Processing Systems 36, 39236-39256, 2023 | 148 | 2023 |
| LOKI: Long Term and Key Intentions for Trajectory Prediction H Girase, H Gang, S Malla, J Li, A Kanehara, K Mangalam, C Choi IEEE International Conference on Computer Vision, 2021 | 133 | 2021 |
| Object-region video transformers R Herzig, E Ben-Avraham, K Mangalam, A Bar, G Chechik, A Rohrbach, ... IEEE Conference on Computer Vision and Pattern Recognition, 2022 | 123 | 2022 |
| Diffusion models as masked autoencoders C Wei, K Mangalam, PY Huang, Y Li, H Fan, H Xu, H Wang, C Xie, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 112 | 2023 |
| Reversible vision transformers K Mangalam, H Fan, Y Li, CY Wu, B Xiong, C Feichtenhofer, J Malik Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 94 | 2022 |
| Llm2llm: Boosting llms with novel iterative data enhancement N Lee, T Wattanawong, S Kim, K Mangalam, S Shen, G Anumanchipalli, ... Findings of the Association for Computational Linguistics: ACL 2024, 6498-6526, 2024 | 88 | 2024 |
| Disentangling human dynamics for pedestrian locomotion forecasting with noisy supervision K Mangalam, E Adeli, KH Lee, A Gaidon, JC Niebles IEEE Winter Conference on Applications of Computer Vision, 2020 | 88 | 2020 |
| Do deep neural networks learn shallow learnable examples first? K Mangalam, VU Prabhu Understanding Deep Phenomena, International Conference on Machine Learning, 2019 | 55 | 2019 |
| Re2tal: Rewiring pretrained video backbones for reversible temporal action localization C Zhao, S Liu, K Mangalam, B Ghanem Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 44 | 2023 |