[go: up one dir, main page]

Follow
Gabriel Sarch
Title
Cited by
Cited by
Year
Tidee: Tidying up novel rooms using visuo-semantic commonsense priors
G Sarch, Z Fang, AW Harley, P Schydlo, MJ Tarr, S Gupta, K Fragkiadaki
European Conference on Computer Vision (ECCV) 2022, 480-496, 2022
652022
Open-ended instructable embodied agents with memory-augmented large language models
G Sarch, Y Wu, MJ Tarr, K Fragkiadaki
Empirical Methods in Natural Language Processing (EMNLP) Findings 2023, 2023
522023
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
GH Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
46*2024
Move to see better: Self-improving embodied object detection
Z Fang, A Jain, G Sarch, AW Harley, K Fragkiadaki
British Machine Vision Conference (BMVC) 2021, 2020
43*2020
Detailed characterization of neural selectivity in free viewing primates
JL Yates, SH Coop, GH Sarch, RJ Wu, DA Butts, M Rucci, JF Mitchell
Nature Communications 14 (1), 3656, 2023
39*2023
Odin: A single model for 2d and 3d segmentation
A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
30*2024
Grounded Reinforcement Learning for Visual Reasoning
G Sarch, S Saha, N Khandelwal, A Jain, MJ Tarr, A Kumar, K Fragkiadaki
arXiv preprint arXiv:2505.23678, 2025
232025
Brain Dissection: fMRI-trained Networks Reveal Spatial Selectivity in the Processing of Natural Images
GH Sarch, MJ Tarr, K Fragkiadaki, L Wehbe
Conference on Neural Information Processing Systems (NeurIPS) 2023, 2023
132023
Reanimating Images using Neural Representations of Dynamic Stimuli
J Yeung, AF Luo, G Sarch, MM Henderson, D Ramanan, MJ Tarr
Proceedings of the Computer Vision and Pattern Recognition Conference, 5331-5343, 2025
72025
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
G Sarch, S Somani, R Kapoor, MJ Tarr, K Fragkiadaki
ICLR 2024 LLMAgents Workshop, 0
6*
3d view prediction models of the dorsal visual stream
G Sarch, HYF Tung, A Wang, J Prince, M Tarr
Conference on Cognitive Computational Neuroscience (CCN) 2023, 2023
32023
Out of sight, not out of context? egocentric spatial reasoning in vlms across disjoint frames
S Ravi, GH Sarch, V Vineet, AD Wilson, BT Kumaravel
Proceedings of the 2025 Conference on Empirical Methods in Natural Language …, 2025
22025
Grounding Task Assistance with Multimodal Cues from a Single Demonstration
GH Sarch, BT Kumaravel, S Ravi, V Vineet, AD Wilson
Findings of the Association for Computational Linguistics: ACL 2025, 12807-12833, 2025
12025
Laminar and cell class distinctions for pre-saccadic attention in marmoset MT/MTC
A Bucklaew, S Coop, G Sarch, J Mitchell
Journal of Vision 23 (11), 39-39, 2023
2023
Laminar Organization of Pre-Saccadic Attention in Marmoset Area MT
SH Coop, GH Sarch, A Bucklaew, JL Yates, JF Mitchell
Journal of Vision 22 (14), 3969-3969, 2022
2022
From Observation to Abstractions: Efficient In-Context Learning from Human Feedback and Visual Demonstrations for VLM Agents
GH Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki
Workshop on Training Agents with Foundation Models at RLC 2024, 0
ODIN: A Single Model for 2D and 3D Segmentation Supplementary Materials
A Jain, P Katara, N Gkanatsios, AW Harley, G Sarch, K Aggarwal, ...
Embodied Symbiotic Assistants that See, Act, Infer and Chat
Y Cao, N Pande, A Jain, S Sharma, G Sarch, N Gkanatsios, X Zhou, ...
The system can't perform the operation now. Try again later.
Articles 1–18