[go: up one dir, main page]

Follow
Georgios Pantazopoulos
Georgios Pantazopoulos
Other namesGeorge Pantazopoulos
Verified email at hw.ac.uk - Homepage
Title
Cited by
Cited by
Year
Data augmentation using GANs for speech emotion recognition.
A Chatziagapi, G Paraskevopoulos, D Sgouropoulos, G Pantazopoulos, ...
Interspeech, 171-175, 2019
2002019
Learning to see but forgetting to follow: Visual instruction tuning makes llms more prone to jailbreak attacks
G Pantazopoulos, A Parekh, M Nikandrou, A Suglia
arXiv preprint arXiv:2405.04403, 2024
162024
Combine to describe: Evaluating compositional generalization in image captioning
G Pantazopoulos, A Suglia, A Eshghi
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
162022
Multitask multimodal prompted training for interactive embodied task completion
G Pantazopoulos, M Nikandrou, A Parekh, B Hemanthage, A Eshghi, ...
arXiv preprint arXiv:2311.04067, 2023
122023
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users
A Karamolegkou, M Nikandrou, G Pantazopoulos, DS Villegas, P Rust, ...
arXiv preprint arXiv:2503.22610, 2025
82025
Lost in space: Probing fine-grained spatial understanding in vision and language resamplers
G Pantazopoulos, A Suglia, O Lemon, A Eshghi
arXiv preprint arXiv:2404.13594, 2024
82024
ViCA: Combining visual, social, and task-oriented conversational AI in a healthcare setting
G Pantazopoulos, J Bruyere, M Nikandrou, T Boissier, S Hemanthage, ...
Proceedings of the 2021 International Conference on Multimodal Interaction …, 2021
82021
Enhancing continual learning in visual question answering with modality-aware feature distillation
M Nikandrou, G Pantazopoulos, I Konstas, A Suglia
arXiv preprint arXiv:2406.19297, 2024
72024
CROPE: Evaluating in-context adaptation of vision and language models to culture-specific concepts
M Nikandrou, G Pantazopoulos, N Vitsakis, I Konstas, A Suglia
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of …, 2025
62025
Shaking up vlms: Comparing transformers and structured state space models for vision & language modeling
G Pantazopoulos, M Nikandrou, A Suglia, O Lemon, A Eshghi
arXiv preprint arXiv:2409.05395, 2024
52024
Using Oliver API for emotion-aware movie content characterization
T Giannakopoulos, S Dimopoulos, G Pantazopoulos, A Chatziagapi, ...
2019 International Conference on Content-Based Multimedia Indexing (CBMI), 1-4, 2019
32019
Towards understanding visual grounding in visual language models
G Pantazopoulos, EB Özyiğit
arXiv preprint arXiv:2509.10345, 2025
22025
Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments
A Suglia, B Hemanthage, M Nikandrou, G Pantazopoulos, A Parekh, ...
Proceedings of the 23rd Annual Meeting of the Special Interest Group on …, 2022
22022
EMMA: A Foundation Model for Embodied, Interactive, Multimodal Task Completion in 3D Environments
A Parekh, M Nikandrou, G Pantazopoulos, B Hemanthage, A Eshghi, ...
12023
An Efficient Training Pipeline for Reasoning Graphical User Interface Agents
G Pantazopoulos, EB Özyiğit
arXiv preprint arXiv:2511.08172, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–15