[go: up one dir, main page]

Follow
Vasudev Lal
Vasudev Lal
Verified email at oracle.com - Homepage
Title
Cited by
Cited by
Year
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
1892024
Bridgetower: Building bridges between encoders in vision-language representation learning
X Xu, C Wu, S Rosenman, V Lal, W Che, N Duan
Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 10637 …, 2023
1102023
Vl-interpret: An interactive visualization tool for interpreting vision-language transformers
E Aflalo, M Du, SY Tseng, Y Liu, C Wu, N Duan, V Lal
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022
822022
Ldm3d: Latent diffusion model for 3d
GBM Stan, D Wofk, S Fox, A Redden, W Saxton, J Yu, E Aflalo, SY Tseng, ...
arXiv preprint arXiv:2305.10853, 2023
712023
Brain encoding models based on multimodal transformers can transfer across language and vision
J Tang, M Du, V Vo, V Lal, A Huth
Advances in Neural Information Processing Systems 36, 29654-29666, 2023
682023
Lvlm-intrepret: An interpretability tool for large vision-language models
G Ben Melech Stan, E Aflalo, RY Rohekar, A Bhiwandiwalla, SY Tseng, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
572024
Coco-counterfactuals: Automatically constructed counterfactual examples for image-text pairs
T Le, V Lal, P Howard
Advances in Neural Information Processing Systems 36, 71195-71221, 2023
462023
Socialcounterfactuals: Probing and mitigating intersectional social biases in vision-language models with counterfactual examples
P Howard, A Madasu, T Le, GL Moreno, A Bhiwandiwalla, V Lal
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
452024
Kd-vlp: Improving end-to-end vision-and-language pretraining with object knowledge distillation
Y Liu, C Wu, S Tseng, V Lal, X He, N Duan
Findings of the Association for Computational Linguistics: NAACL 2022, 1589-1600, 2022
382022
Neurocounterfactuals: Beyond minimal-edit counterfactuals for richer data augmentation
P Howard, G Singer, V Lal, Y Choi, S Swayamdipta
arXiv preprint arXiv:2210.12365, 2022
312022
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
A Chatterjee, GBM Stan, E Aflalo, S Paul, D Ghosh, T Gokhale, L Schmidt, ...
European Conference on Computer Vision, 204-222, 2024
292024
Is your paper being reviewed by an llm? investigating ai text detectability in peer review
S Yu, M Luo, A Madasu, V Lal, P Howard
arXiv preprint arXiv:2410.03019, 2024
252024
InterpreT: An interactive visualization tool for interpreting transformers
V Lal, A Ma, E Aflalo, P Howard, A Simoes, D Korat, O Pereg, G Singer, ...
Proceedings of the 16th Conference of the European Chapter of the …, 2021
232021
Cross-domain aspect extraction using transformers augmented with knowledge graphs
P Howard, A Ma, V Lal, AP Simoes, D Korat, O Pereg, M Wasserblat, ...
Proceedings of the 31st ACM International Conference on Information …, 2022
222022
Llava-gemma: Accelerating multimodal foundation models with a compact language model
M Hinck, ML Olson, D Cobbley, SY Tseng, V Lal
arXiv preprint arXiv:2404.01331, 2024
212024
LVLM-Interpret: an interpretability tool for large vision-language models
GBM Stan, E Aflalo, RY Rohekar, A Bhiwandiwalla, SY Tseng, ML Olson, ...
arXiv preprint arXiv:2404.03118, 2024
182024
Neuroprompts: An adaptive framework to optimize prompts for text-to-image generation
S Rosenman, V Lal, P Howard
Proceedings of the 18th Conference of the European Chapter of the …, 2024
172024
Opinion-based relational pivoting for cross-domain aspect term extraction
A Klein, O Pereg, D Korat, V Lal, M Wasserblat, I Dagan
Proceedings of the 12th workshop on computational approaches to subjectivity …, 2022
172022
Why do llava vision-language models reply to images in english?
M Hinck, C Holtermann, ML Olson, F Schneider, S Yu, A Bhiwandiwalla, ...
Findings of the Association for Computational Linguistics: EMNLP 2024, 13402 …, 2024
162024
Improving video retrieval using multilingual knowledge transfer
A Madasu, E Aflalo, G Ben Melech Stan, SY Tseng, G Bertasius, V Lal
European Conference on Information Retrieval, 669-684, 2023
162023
The system can't perform the operation now. Try again later.
Articles 1–20