David Ross

Cited by

	All	Since 2021
Citations	13993	8030
h-index	38	29
i10-index	61	45

3400

1700

850

2550

20062007200820092010201120122013201420152016201720182019202020212022202320242025202665 77 83 143 195 256 334 448 569 655 680 516 572 602 622 857 947 1098 1630 3304 190

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ming-Hsuan YangUniversity of California at Merced; Google DeepMindVerified email at ucmerced.edu
Ruei-Sung LinPAII IncVerified email at paii-labs.com
Jongwoo LimProfessor, Seoul National University, Dept. of Mechanical EngineeringVerified email at snu.ac.kr
Sudheendra VijayanarasimhanGoogle Inc.Verified email at cs.utexas.edu
Cordelia SchmidResearch director INRIA Verified email at inria.fr
Alireza FathiGoogle DeepMindVerified email at cs.stanford.edu
Rahul SukthankarGoogle DeepMindVerified email at google.com
Caroline Rebecca PantofaruGoogleVerified email at google.com
Chen SunAssistant Professor, Brown UniversityVerified email at brown.edu
Bryan SeyboldGoogle IncVerified email at google.com
Abhijit KunduGoogle DeepMindVerified email at google.com
Richard ZemelProfessor of Computer Science, University of TorontoVerified email at cs.toronto.edu

David Ross

Google DeepMind

Verified email at google.com - Homepage

computer vision video understanding machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Incremental learning for robust visual tracking DA Ross, J Lim, RS Lin, MH Yang International journal of computer vision 77 (1), 125-141, 2008	4159	2008
Ava: A video dataset of spatio-temporally localized atomic visual actions C Gu, C Sun, DA Ross, C Vondrick, C Pantofaru, Y Li, ... Proceedings of the IEEE conference on computer vision and pattern …, 2018	1462	2018
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025	1320	2025
Rethinking the faster r-cnn architecture for temporal action localization YW Chao, S Vijayanarasimhan, B Seybold, DA Ross, J Deng, ... Proceedings of the IEEE conference on computer vision and pattern …, 2018	910	2018
Ai choreographer: Music conditioned 3d dance generation with aist++ R Li, S Yang, DA Ross, A Kanazawa Proceedings of the IEEE/CVF international conference on computer vision …, 2021	778	2021
Language Model Beats Diffusion--Tokenizer is Key to Visual Generation L Yu, J Lezama, NB Gundavarapu, L Versari, K Sohn, D Minnen, Y Cheng, ... arXiv preprint arXiv:2310.05737, 2023	565	2023
Incremental learning for visual tracking J Lim, D Ross, RS Lin, MH Yang Advances in neural information processing systems 17, 2004	423	2004
Videopoet: A large language model for zero-shot video generation D Kondratyuk, L Yu, X Gu, J Lezama, J Huang, G Schindler, R Hornung, ... arXiv preprint arXiv:2312.14125, 2023	418	2023
Pillar-based object detection for autonomous driving Y Wang, A Fathi, A Kundu, DA Ross, C Pantofaru, T Funkhouser, ... European Conference on Computer Vision, 18-34, 2020	297	2020
D3d: Distilled 3d networks for video action recognition J Stroud, D Ross, C Sun, J Deng, R Sukthankar Proceedings of the IEEE/CVF winter conference on applications of computer …, 2020	260	2020
Virtual multi-view fusion for 3d semantic segmentation A Kundu, X Yin, A Fathi, D Ross, B Brewington, T Funkhouser, ... European conference on computer vision, 518-535, 2020	229	2020
Adaptive probabilistic visual tracking with incremental subspace update D Ross, J Lim, MH Yang European conference on computer vision, 470-482, 2004	216	2004
The ava-kinetics localized human actions video dataset A Li, M Thotakuri, DA Ross, J Carreira, A Vostrikov, A Zisserman arXiv preprint arXiv:2005.00214, 2020	186	2020
Reveal: Retrieval-augmented visual-language pre-training with multi-source multimodal knowledge memory Z Hu, A Iscen, C Sun, Z Wang, KW Chang, Y Sun, C Schmid, DA Ross, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	178	2023
The power of comparative reasoning J Yagnik, D Strelow, DA Ross, R Lin 2011 International Conference on Computer Vision, 2431-2438, 2011	177	2011
An lstm approach to temporal 3d object detection in lidar point clouds R Huang, W Zhang, A Kundu, C Pantofaru, DA Ross, T Funkhouser, ... European Conference on Computer Vision, 266-282, 2020	149	2020
Adaptive discriminative generative model and its applications RS Lin, D Ross, J Lim, MH Yang Advances in neural information processing systems 17, 2004	130	2004
Learn to dance with aist++: Music conditioned 3d dance generation R Li, S Yang, DA Ross, A Kanazawa arXiv preprint arXiv:2101.08779 2 (3), 2021	124	2021
Scenecraft: An llm agent for synthesizing 3d scenes as blender code Z Hu, A Iscen, A Jain, T Kipf, Y Yue, DA Ross, C Schmid, A Fathi Forty-first International Conference on Machine Learning, 2024	102	2024
Unloc: A unified framework for video localization tasks S Yan, X Xiong, A Nagrani, A Arnab, Z Wang, W Ge, D Ross, C Schmid Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	100	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors