Rajat Koner

Cited by

	All	Since 2021
Citations	487	476
h-index	10	10
i10-index	10	10

140

105

20202021202220232024202520269 27 73 101 133 139 2

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Volker TrespLudwig-Maximilians-Universität München (LMU Munich)Verified email at dbs.ifi.lmu.de
Stephan GünnemannProfessor of Computer Science, Technical University of MunichVerified email at in.tum.de
Sahand SharifzadehGoogle DeepMindVerified email at deepmind.com
Hang LiLudwig Maximilian University of Munich, Siemens AGVerified email at campus.lmu.de
Marcel HildebrandtSiemens Technology, LMUVerified email at siemens.com
Max BerrendorfDeepLVerified email at deepl.com
Prateek JainGoogle Research IndiaVerified email at google.com
Alois KnollTechnische Universität MünchenVerified email at in.tum.de
Sujoy PaulResearch Scientist at Google DeepMindVerified email at ucr.edu

Rajat Koner

LMU, Munich, intern at DeepMind

Verified email at google.com - Homepage

Computer Vision Scene/Video/Multi-Modal Understandings


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Scene graph reasoning for visual question answering M Hildebrandt, H Li, R Koner*, V Tresp, S Günnemann ICML 2020: GRL+, 2020	101	2020
Relationformer: A Unified Framework for Image-to-Graph Generation S Shit, R Koner, B Wittmann, J Paetzold, I Ezhov, H Li, J Pan, ... ECCV,2022, 2022	98	2022
Oodformer: Out-of-distribution detection transformer R Koner, P Sinhamahapatra, K Roscher, S Günnemann, V Tresp The British Machine Vision Conference (BMVC), 2021, 2021	64	2021
Graphhopper: Multi-hop scene graph reasoning for visual question answering R Koner, H Li, M Hildebrandt, D Das, V Tresp, S Günnemann International Semantic Web Conference, 111-127, 2021	51	2021
Relation transformer network R Koner, S Shit, V Tresp arXiv preprint arXiv:2004.06193, 2020	43	2020
Improving visual relation detection using depth maps S Sharifzadeh, SM Baharlou, M Berrendorf, R Koner, V Tresp 2020 25th International Conference on Pattern Recognition (ICPR), 3597-3604, 2021	31	2021
Instanceformer: An online video instance segmentation framework R Koner, VT T Hannan, S Shit, S Sharifzadeh, M Schubert, T Seidl AAAI 2023, 2023	22*	2023
LookupViT: Compressing visual information to a limited number of tokens R Koner, G Jain, P Jain, V Tresp, S Paul ECCV,24, 2024	17	2024
Do dall-e and flamingo understand each other? H Li, J Gu, R Koner, S Sharifzadeh, V Tresp Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	17	2023
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation T Hannan, R Koner, M Bernhard, S Shit, B Menze, V Tresp, M Schubert, ... (ORAL) 2024 27th International Conference on Pattern Recognition (ICPR), 2023	10	2023
Is it all a cluster game?--Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space P Sinhamahapatra, R Koner, K Roscher, S Günnemann SafeAI@AAAI, 2022, 2022	8	2022
Box Supervised Video Segmentation Proposal Network T Hannan, R Koner, J Kobold, M Schubert arXiv preprint arXiv:2202.07025, 2022	8	2022
Scenes and surroundings: Scene graph generation using relation transformer R Koner, P Sinhamahapatra, V Tresp ICML 2020, GNN & BEYOND, 2021	7	2021
Vesselformer: Towards complete 3d vessel graph generation from images C Prabhakar, S Shit, JC Paetzold, I Ezhov, R Koner, H Li, FS Kofler, ... Medical Imaging with Deep Learning, 320-331, 2024	5	2024
Perceive. Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries R Amoroso, G Zhang, R Koner, L Baraldi, R Cucchiara, V Tresp 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV …, 2025	2	2025
Random finite set based Bayesian filtering with OpenCL in a heterogeneous platform B Hu, U Sharif, R Koner, G Chen, K Huang, F Zhang, W Stechele, A Knoll Sensors 17 (4), 843, 2017	2	2017
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models W Zhu, X Chen, Z Wang, S Tang, S Ghosh, X Dong, R Koner, Y Wang arXiv preprint arXiv:2508.11886, 2025	1	2025
SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding T Hannan, S Wu, M Weber, S Shit, J Gu, R Koner, A Ošep, L Leal-Taixé, ... arXiv preprint arXiv:2510.13016, 2025		2025
Local2Global query Alignment for Video Instance Segmentation R Koner, Z Wang, S Parthasarathy, C Chen Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025		2025

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors