| Scene graph reasoning for visual question answering M Hildebrandt*, H Li*, R Koner*, V Tresp, S Günnemann ICML 2020: GRL+, 2020 | 101 | 2020 |
| Relationformer: A Unified Framework for Image-to-Graph Generation S Shit*, R Koner*, B Wittmann, J Paetzold, I Ezhov, H Li, J Pan, ... ECCV,2022, 2022 | 98 | 2022 |
| Oodformer: Out-of-distribution detection transformer R Koner, P Sinhamahapatra, K Roscher, S Günnemann, V Tresp The British Machine Vision Conference (BMVC), 2021, 2021 | 64 | 2021 |
| Graphhopper: Multi-hop scene graph reasoning for visual question answering R Koner, H Li, M Hildebrandt, D Das, V Tresp, S Günnemann International Semantic Web Conference, 111-127, 2021 | 51 | 2021 |
| Relation transformer network R Koner, S Shit, V Tresp arXiv preprint arXiv:2004.06193, 2020 | 43 | 2020 |
| Improving visual relation detection using depth maps S Sharifzadeh, SM Baharlou, M Berrendorf, R Koner, V Tresp 2020 25th International Conference on Pattern Recognition (ICPR), 3597-3604, 2021 | 31 | 2021 |
| Instanceformer: An online video instance segmentation framework R Koner, VT T Hannan, S Shit, S Sharifzadeh, M Schubert, T Seidl AAAI 2023, 2023 | 22* | 2023 |
| LookupViT: Compressing visual information to a limited number of tokens R Koner, G Jain, P Jain, V Tresp, S Paul ECCV,24, 2024 | 17 | 2024 |
| Do dall-e and flamingo understand each other? H Li, J Gu, R Koner, S Sharifzadeh, V Tresp Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 17 | 2023 |
| GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation T Hannan*, R Koner*, M Bernhard, S Shit, B Menze, V Tresp, M Schubert, ... (ORAL) 2024 27th International Conference on Pattern Recognition (ICPR), 2023 | 10 | 2023 |
| Is it all a cluster game?--Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space P Sinhamahapatra, R Koner, K Roscher, S Günnemann SafeAI@AAAI, 2022, 2022 | 8 | 2022 |
| Box Supervised Video Segmentation Proposal Network T Hannan*, R Koner*, J Kobold, M Schubert arXiv preprint arXiv:2202.07025, 2022 | 8 | 2022 |
| Scenes and surroundings: Scene graph generation using relation transformer R Koner, P Sinhamahapatra, V Tresp ICML 2020, GNN & BEYOND, 2021 | 7 | 2021 |
| Vesselformer: Towards complete 3d vessel graph generation from images C Prabhakar, S Shit, JC Paetzold, I Ezhov, R Koner, H Li, FS Kofler, ... Medical Imaging with Deep Learning, 320-331, 2024 | 5 | 2024 |
| Perceive. Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries R Amoroso, G Zhang, R Koner, L Baraldi, R Cucchiara, V Tresp 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV …, 2025 | 2 | 2025 |
| Random finite set based Bayesian filtering with OpenCL in a heterogeneous platform B Hu, U Sharif, R Koner, G Chen, K Huang, F Zhang, W Stechele, A Knoll Sensors 17 (4), 843, 2017 | 2 | 2017 |
| EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models W Zhu, X Chen, Z Wang, S Tang, S Ghosh, X Dong, R Koner, Y Wang arXiv preprint arXiv:2508.11886, 2025 | 1 | 2025 |
| SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding T Hannan, S Wu, M Weber, S Shit, J Gu, R Koner, A Ošep, L Leal-Taixé, ... arXiv preprint arXiv:2510.13016, 2025 | | 2025 |
| Local2Global query Alignment for Video Instance Segmentation R Koner, Z Wang, S Parthasarathy, C Chen Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2025 | | 2025 |