Gabriel Sarch

Cited by

	All	Since 2021
Citations	330	329
h-index	8	8
i10-index	8	8

180

135

2021202220232024202520263 14 36 93 177 6

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Katerina FragkiadakiAssociate Professor, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Michael J. TarrDepartment of Psychology, Neuroscience Institute, Machine Learning, Carnegie Mellon UniversityVerified email at cmu.edu
Adam W. HarleyResearch Scientist, MetaVerified email at meta.com
Ayush JainPhD Student in Robotics, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Zhaoyuan FangCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Jude MitchellUniversity of RochesterVerified email at ur.rochester.edu
William W. CohenCMU and Google DeepMindVerified email at google.com
Kenneth MarinoUniversity of UtahVerified email at utah.edu
Jacob L YatesUC BerkeleyVerified email at berkeley.edu
Saurabh GuptaUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Aviral KumarCarnegie Mellon UniversityVerified email at andrew.cmu.edu
Andrew WilsonMicrosoft ResearchVerified email at microsoft.com
Yue WuPhD Student, CMUVerified email at andrew.cmu.edu

Gabriel Sarch

Carnegie Mellon University

Verified email at andrew.cmu.edu - Homepage

Artificial Intelligence Reinforcement Learning Computer Vision Neuroscience


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Tidee: Tidying up novel rooms using visuo-semantic commonsense priors G Sarch, Z Fang, AW Harley, P Schydlo, MJ Tarr, S Gupta, K Fragkiadaki European Conference on Computer Vision (ECCV) 2022, 480-496, 2022	65	2022
Open-ended instructable embodied agents with memory-augmented large language models G Sarch, Y Wu, MJ Tarr, K Fragkiadaki Empirical Methods in Natural Language Processing (EMNLP) Findings 2023, 2023	52	2023
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought GH Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024	46*	2024
Move to see better: Self-improving embodied object detection Z Fang, A Jain, G Sarch, AW Harley, K Fragkiadaki British Machine Vision Conference (BMVC) 2021, 2020	43*	2020
Detailed characterization of neural selectivity in free viewing primates JL Yates, SH Coop, GH Sarch, RJ Wu, DA Butts, M Rucci, JF Mitchell Nature Communications 14 (1), 3656, 2023	39*	2023
Odin: A single model for 2d and 3d segmentation A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	30*	2024
Grounded Reinforcement Learning for Visual Reasoning G Sarch, S Saha, N Khandelwal, A Jain, MJ Tarr, A Kumar, K Fragkiadaki arXiv preprint arXiv:2505.23678, 2025	23	2025
Brain Dissection: fMRI-trained Networks Reveal Spatial Selectivity in the Processing of Natural Images GH Sarch, MJ Tarr, K Fragkiadaki, L Wehbe Conference on Neural Information Processing Systems (NeurIPS) 2023, 2023	13	2023
Reanimating Images using Neural Representations of Dynamic Stimuli J Yeung, AF Luo, G Sarch, MM Henderson, D Ramanan, MJ Tarr Proceedings of the Computer Vision and Pattern Recognition Conference, 5331-5343, 2025	7	2025
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models G Sarch, S Somani, R Kapoor, MJ Tarr, K Fragkiadaki ICLR 2024 LLMAgents Workshop, 0	6*
3d view prediction models of the dorsal visual stream G Sarch, HYF Tung, A Wang, J Prince, M Tarr Conference on Cognitive Computational Neuroscience (CCN) 2023, 2023	3	2023
Out of sight, not out of context? egocentric spatial reasoning in vlms across disjoint frames S Ravi, GH Sarch, V Vineet, AD Wilson, BT Kumaravel Proceedings of the 2025 Conference on Empirical Methods in Natural Language …, 2025	2	2025
Grounding Task Assistance with Multimodal Cues from a Single Demonstration GH Sarch, BT Kumaravel, S Ravi, V Vineet, AD Wilson Findings of the Association for Computational Linguistics: ACL 2025, 12807-12833, 2025	1	2025
Laminar and cell class distinctions for pre-saccadic attention in marmoset MT/MTC A Bucklaew, S Coop, G Sarch, J Mitchell Journal of Vision 23 (11), 39-39, 2023		2023
Laminar Organization of Pre-Saccadic Attention in Marmoset Area MT SH Coop, GH Sarch, A Bucklaew, JL Yates, JF Mitchell Journal of Vision 22 (14), 3969-3969, 2022		2022
From Observation to Abstractions: Efficient In-Context Learning from Human Feedback and Visual Demonstrations for VLM Agents GH Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki Workshop on Training Agents with Foundation Models at RLC 2024, 0
ODIN: A Single Model for 2D and 3D Segmentation Supplementary Materials A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ...
Embodied Symbiotic Assistants that See, Act, Infer and Chat Y Cao, N Pande, A Jain, S Sharma, G Sarch, N Gkanatsios, X Zhou, ...

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors