| Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces J Yang*, S Yang*, AW Gupta*, R Han*, L Fei-Fei, S Xie CVPR 2025 Oral, 2024 | 333 | 2024 |
| Sean 2.0: Formalizing and generating social situations for robot navigation N Tsoi, A Xiang, P Yu, SS Sohn, G Schwartz, S Ramesh, M Hussein, ... IROS 2022, 2022 | 77 | 2022 |
| Conversational group detection with graph neural networks S Thompson, A Gupta, AW Gupta, A Chen, M Vázquez ICMI 2021, 2021 | 33 | 2021 |
| Test-Time Adaptation for Depth Completion H Park, A Gupta, A Wong CVPR 2024, 2024 | 29 | 2024 |