[go: up one dir, main page]

Follow
Meera Hahn
Meera Hahn
Research Scientist, Google
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Videopoet: A large language model for zero-shot video generation
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ...
arXiv preprint arXiv:2312.14125, 2023
4122023
Photorealistic video generation with diffusion models
A Gupta, L Yu, K Sohn, X Gu, M Hahn, FF Li, I Essa, L Jiang, J Lezama
European Conference on Computer Vision, 393-411, 2024
2872024
Tripping through time: Efficient Localization of Activities in Videos
M Hahn, A Kadav, JM Rehg, HP Graf
British Machine Vision Conference (BMVC), 2020
1122020
No RL, No Simulation: Learning to Navigate without Navigating
M Hahn, D Chaplot, S Tulsiani, M Mukadam, JM Rehg, A Gupta
Advances in Neural Information Processing Systems, 2021
1102021
Action2Vec: A Crossmodal Embedding Approach to Action Learning
M Hahn, A Silva, JM Rehg
The IEEE Conference Conference on Computer Vision and Pattern Recognition …, 2018
762018
Where are you? localization from embodied dialog
M Hahn, J Krantz, D Batra, D Parikh, JM Rehg, S Lee, P Anderson
Empirical Methods in Natural Language Processing (EMNLP), 2020
352020
Situated bayesian reasoning framework for robots operating in diverse everyday environments
S Chernova, V Chu, A Daruna, H Garrison, M Hahn, P Khante, W Liu, ...
Robotics Research: The 18th International Symposium ISRR, 353-369, 2019
332019
Deep tracking: Visual tracking using deep convolutional networks
M Hahn, S Chen, A Dehghan
arXiv preprint arXiv:1512.03993, 2015
122015
Videopoet: A large language model for zero-shot video generation, 2024
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ...
URL https://arxiv. org/abs/2312.14125 1, 2024
11*2024
FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models
G Zhang, K Sohn, M Hahn, H Shi, I Essa
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
112024
Learning to localize and align fine-grained actions to sparse instructions
M Hahn, N Ruiz, JB Alayrac, I Laptev, JM Rehg
The IEEE Conference Conference on Computer Vision and Pattern Recognition …, 2017
102017
Transformer-based Localization from Embodied Dialog with Large-scale Pre-training
M Hahn, JM Rehg
Conference of the Asia-Pacific Association for Computational Linguistics …, 2022
92022
Proactive agents for multi-turn text-to-image generation under uncertainty
M Hahn, W Zeng, N Kannen, R Galt, K Badola, B Kim, Z Wang
arXiv preprint arXiv:2412.06771, 2024
82024
Efficient and fine-grained video retrieval
A Kadav, I Melvin, HP Graf, M Hahn
US Patent 11,568,247, 2023
72023
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
S Yu, M Hahn, D Kondratyuk, J Shin, A Gupta, J Lezama, I Essa, D Ross, ...
arXiv preprint arXiv:2502.12632, 2025
52025
SiRoK: Situated Robot Knowledge-Understanding the Balance Between Situated Knowledge and Variability.
AA Daruna, V Chu, W Liu, M Hahn, P Khante, S Chernova, A Thomaz
AAAI Spring Symposia, 2018
42018
Which way is right?: Uncovering limitations of Vision-and-Language Navigation model
M Hahn, A Raj, JM Rehg
arXiv preprint arXiv:2312.00151, 2023
32023
Learning a visually grounded memory assistant
M Hahn, K Carlberg, R Desai, J Hillis
arXiv preprint arXiv:2210.03787, 2022
22022
Text and Click inputs for unambiguous open vocabulary instance segmentation
N Warner, M Hahn, J Huang, I Essa, V Birodkar
arXiv preprint arXiv:2311.14822, 2023
12023
Generating temporal sequences using diffusion transformer neural networks
S Yu, MS Hahn, A Gupta, JLT de la Llosa, IA Essa, DA Ross, J Huang
US Patent App. 19/216,518, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–20