| Data augmentation using GANs for speech emotion recognition. A Chatziagapi, G Paraskevopoulos, D Sgouropoulos, G Pantazopoulos, ... Interspeech, 171-175, 2019 | 200 | 2019 |
| Learning to see but forgetting to follow: Visual instruction tuning makes llms more prone to jailbreak attacks G Pantazopoulos, A Parekh, M Nikandrou, A Suglia Safety4ConvAI @ LREC-COLING 2024, 2024 | 16 | 2024 |
| Multitask multimodal prompted training for interactive embodied task completion G Pantazopoulos, M Nikandrou, A Parekh, B Hemanthage, A Eshghi, ... EMLP 2023, 2023 | 12 | 2023 |
| Task formulation matters when learning continually: A case study in visual question answering M Nikandrou, L Yu, A Suglia, I Konstas, V Rieser arXiv preprint arXiv:2210.00044, 2022 | 12 | 2022 |
| Quality-agnostic image captioning to safely assist people with vision impairment L Yu, M Nikandrou, J Jin, V Rieser IJCAI 2023, 2023 | 9 | 2023 |
| Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users A Karamolegkou, M Nikandrou, G Pantazopoulos, DS Villegas, P Rust, ... arXiv preprint arXiv:2503.22610, 2025 | 8 | 2025 |
| Going for GOAL: A resource for grounded football commentaries A Suglia, J Lopes, E Bastianelli, A Vanzo, S Agarwal, M Nikandrou, L Yu, ... arXiv preprint arXiv:2211.04534, 2022 | 8 | 2022 |
| ViCA: Combining visual, social, and task-oriented conversational AI in a healthcare setting G Pantazopoulos, J Bruyere, M Nikandrou, T Boissier, S Hemanthage, ... Proceedings of the 2021 International Conference on Multimodal Interaction …, 2021 | 8 | 2021 |
| Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation M Nikandrou, G Pantazopoulos, I Konstas, A Suglia ALVR @ ACL 2024, 2024 | 7 | 2024 |
| CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts M Nikandrou, G Pantazopoulos, N Vitsakis, I Konstas, A Suglia NAACL 2025, 2024 | 6 | 2024 |
| Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling G Pantazopoulos, M Nikandrou, A Suglia, O Lemon, A Eshghi EMNLP 2024, 2024 | 5 | 2024 |
| Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments A Suglia, B Hemanthage, M Nikandrou, G Pantazopoulos, A Parekh, ... Proceedings of the 23rd Annual Meeting of the Special Interest Group on …, 2022 | 2 | 2022 |
| EMMA: A Foundation Model for Embodied, Interactive, Multimodal Task Completion in 3D Environments A Parekh, M Nikandrou, G Pantazopoulos, B Hemanthage, A Eshghi, ... Alexa Prize SimBot Challenge Proceedings, 2023 | 1 | 2023 |