[go: up one dir, main page]

Follow
Sushant Gautam
Sushant Gautam
SimulaMet (Simula Metropolitan Center for Digital Engineering)
Verified email at simula.no - Homepage
Title
Cited by
Cited by
Year
Kvasir-VQA: A Text-Image Pair GI Tract Dataset
S Gautam, AM Storås, C Midoglu, SA Hicks, V Thambawita, P Halvorsen, ...
Proceedings of the First International Workshop on Vision-Language Models …, 2024
352024
Soccer Game Summarization using Audio Commentary, Metadata, and Captions
S Gautam, C Midoglu, S Shafiee Sabet, DB Kshatri, P Halvorsen
Proceedings of the 1st Workshop on User-centric Narrative Summarization of …, 2022
352022
AI-Based Sports Highlight Generation for Social Media
C Midoglu, SS Sabet, MH Sarkhoosh, M Majidi, S Gautam, HM Solberg, ...
Proceedings of the 3rd Mile-High Video Conference, 7-13, 2024
292024
Bridging multimedia modalities: enhanced multimodal AI understanding and intelligent agents
S Gautam
Proceedings of the 25th International Conference on Multimodal Interaction …, 2023
232023
Multimodal AI-based summarization and storytelling for soccer on social media
MH Sarkhoosh, S Gautam, C Midoglu, SS Sabet, P Halvorsen
Proceedings of the 15th ACM multimedia systems conference, 485-491, 2024
212024
Assisting Soccer Game Summarization via Audio Intensity Analysis of Game Highlights
S Gautam, C Midoglu, SS Sabet, DB Kshatri, P Halvorsen
Proceedings of 12th IOE Graduate Conference 12, 25 -- 32, 2022
212022
Soccer on social media
MH Sarkhoosh, SMM Dorcheh, S Gautam, C Midoglu, SS Sabet, ...
arXiv preprint arXiv:2310.12328, 2023
182023
ImageCLEF 2025: multimedia retrieval in medical, social media and content recommendation applications
B Ionescu, H Müller, DC Stanciu, A Idrissi-Yaghir, A Radzhabov, ...
European Conference on Information Retrieval, 398-406, 2025
13*2025
PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips
H Solberg, MH Sarkhoosh, S Gautam, SS Sabet, P Halvorsen, C Midoglu
2024 International Symposium on Multimedia (ISM), 93-97, 2024
102024
SoccerNet-Echoes: A Soccer game audio commentary dataset
S Gautam, MH Sarkhoosh, J Held, C Midoglu, A Cioppa, S Giancola, ...
2024 International Symposium on Multimedia (ISM), 71-78, 2024
102024
Factgenius: Combining zero-shot prompting and fuzzy relation mining to improve fact verification with knowledge graphs
S Gautam, R Pop
Proceedings of the Seventh Fact Extraction and VERification Workshop (FEVER …, 2024
92024
The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch
MH Sarkhoosh, S Gautam, C Midoglu, SS Sabet, T Torjusen, P Halvorsen
Proceedings of the 15th ACM Multimedia Systems Conference, 353-359, 2024
82024
Prompt to Polyp: Medical Text-Conditioned Image Synthesis with Diffusion Models
M Chaichuk, S Gautam, S Hicks, E Tutubalina
arXiv preprint arXiv:2505.05573, 2025
7*2025
Demo: Soccer Information Retrieval via Natural Queries using SoccerRAG
AT Strand, S Gautam, C Midoglu, P Halvorsen
2024 International Conference on Content-Based Multimedia Indexing (CBMI), 1-7, 2024
7*2024
Enhancing structured-data retrieval with graphrag: Soccer data case study
Z Sepasdar, S Gautam, C Midoglu, MA Riegler, P Halvorsen
arXiv preprint arXiv:2409.17580, 2024
62024
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy
S Gautam, M Riegler, P Halvorsen
Data Engineering in Medical Imaging: Third MICCAI Workshop, DEMI 2025, Held …, 2025
52025
Medico 2025: Visual Question Answering for Gastrointestinal Imaging
S Gautam, V Thambawita, M Riegler, P Halvorsen, S Hicks
arXiv preprint arXiv:2508.10869, 2025
52025
Soccer-GraphRAG: Applications of GraphRAG in Soccer
Z Sepasdar, S Gautam, C Midoglu, MA Riegler, P Halvorsen
International Workshop on Graph-Based Approaches in Information Retrieval, 1-10, 2024
52024
AI-based Soccer Game Summarization: From Video Highlights to Dynamic Text Summaries
S Gautam
Tribhuvan University, Institute of Engineering, Thapathali Campus, 2022
52022
Comparative analysis of audio feature extraction for real-time talking portrait synthesis
P Salehi, SA Sheshkal, V Thambawita, S Gautam, SS Sabet, D Johansen, ...
Big Data and Cognitive Computing 9 (3), 59, 2025
32025
The system can't perform the operation now. Try again later.
Articles 1–20