[go: up one dir, main page]

Follow
David Ross
David Ross
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Incremental learning for robust visual tracking
DA Ross, J Lim, RS Lin, MH Yang
International journal of computer vision 77 (1), 125-141, 2008
41592008
Ava: A video dataset of spatio-temporally localized atomic visual actions
C Gu, C Sun, DA Ross, C Vondrick, C Pantofaru, Y Li, ...
Proceedings of the IEEE conference on computer vision and pattern …, 2018
14622018
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13202025
Rethinking the faster r-cnn architecture for temporal action localization
YW Chao, S Vijayanarasimhan, B Seybold, DA Ross, J Deng, ...
Proceedings of the IEEE conference on computer vision and pattern …, 2018
9102018
Ai choreographer: Music conditioned 3d dance generation with aist++
R Li, S Yang, DA Ross, A Kanazawa
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
7782021
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation
L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ...
arXiv preprint arXiv:2310.05737, 2023
5652023
Incremental learning for visual tracking
J Lim, D Ross, RS Lin, MH Yang
Advances in neural information processing systems 17, 2004
4232004
Videopoet: A large language model for zero-shot video generation
D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ...
arXiv preprint arXiv:2312.14125, 2023
4182023
Pillar-based object detection for autonomous driving
Y Wang, A Fathi, A Kundu, DA Ross, C Pantofaru, T Funkhouser, ...
European Conference on Computer Vision, 18-34, 2020
2972020
D3d: Distilled 3d networks for video action recognition
J Stroud, D Ross, C Sun, J Deng, R Sukthankar
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020
2602020
Virtual multi-view fusion for 3d semantic segmentation
A Kundu, X Yin, A Fathi, D Ross, B Brewington, T Funkhouser, ...
European conference on computer vision, 518-535, 2020
2292020
Adaptive probabilistic visual tracking with incremental subspace update
D Ross, J Lim, MH Yang
European conference on computer vision, 470-482, 2004
2162004
The ava-kinetics localized human actions video dataset
A Li, M Thotakuri, DA Ross, J Carreira, A Vostrikov, A Zisserman
arXiv preprint arXiv:2005.00214, 2020
1862020
Reveal: Retrieval-augmented visual-language pre-training with multi-source multimodal knowledge memory
Z Hu, A Iscen, C Sun, Z Wang, KW Chang, Y Sun, C Schmid, DA Ross, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1782023
The power of comparative reasoning
J Yagnik, D Strelow, DA Ross, R Lin
2011 International Conference on Computer Vision, 2431-2438, 2011
1772011
An lstm approach to temporal 3d object detection in lidar point clouds
R Huang, W Zhang, A Kundu, C Pantofaru, DA Ross, T Funkhouser, ...
European Conference on Computer Vision, 266-282, 2020
1492020
Adaptive discriminative generative model and its applications
RS Lin, D Ross, J Lim, MH Yang
Advances in neural information processing systems 17, 2004
1302004
Learn to dance with aist++: Music conditioned 3d dance generation
R Li, S Yang, DA Ross, A Kanazawa
arXiv preprint arXiv:2101.08779 2 (3), 2021
1242021
Scenecraft: An llm agent for synthesizing 3d scenes as blender code
Z Hu, A Iscen, A Jain, T Kipf, Y Yue, DA Ross, C Schmid, A Fathi
Forty-first International Conference on Machine Learning, 2024
1022024
Unloc: A unified framework for video localization tasks
S Yan, X Xiong, A Nagrani, A Arnab, Z Wang, W Ge, D Ross, C Schmid
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1002023
The system can't perform the operation now. Try again later.
Articles 1–20