Vasudev Lal

Cited by

	All	Since 2021
Citations	1103	1102
h-index	17	17
i10-index	25	25

660

330

165

495

2021202220232024202520266 22 96 314 644 18

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Phillip HowardAI Researcher, ThoughtworksVerified email at thoughtworks.com
Shao-Yen TsengIntel LabsVerified email at intel.com
Estelle AflaloIntel LabsVerified email at intel.com
Gabriela Ben Melech StanResearcher Intel LabsVerified email at intel.com
Avinash MadasuIntel Corporation, UNC Chapel HillVerified email at intel.com
Shachar RosenmanIntel LabsVerified email at intel.com
Anahita BhiwandiwallaNVIDIAVerified email at nvidia.com
Chenfei Wu(吴晨飞)Tongyi Lab, AlibabaVerified email at alibaba-inc.com
Nan DuanVice President of JD.Com (now) | StepFun | Microsoft ResearchVerified email at microsoft.com
Tiep LeIntel LabVerified email at intel.com
Man LuoAbridge AI IncVerified email at asu.edu
Sungduk YuOracleVerified email at yale.edu
Musashi HinckPoint72Verified email at princeton.edu
Gadi SingerVP and Lab Director, Intel CorpVerified email at intel.com
Raanan Y. Yehezkel RohekarAI Research Scientist, Intel LabsVerified email at intel.com
Yaniv GurwiczResearch Scientist, Intel LabsVerified email at intel.com
Diana WofkIntelVerified email at intel.com
Neale RatzlaffOracleVerified email at intel.com
Daniel KoratIntel LabsVerified email at intel.com
Meng DuUCLAVerified email at ucla.edu

Vasudev Lal

Oracle

Verified email at oracle.com - Homepage

AI Deep Learning CV NLP


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
xgen-mm (blip-3): A family of open large multimodal models L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ... arXiv preprint arXiv:2408.08872, 2024	189	2024
Bridgetower: Building bridges between encoders in vision-language representation learning X Xu, C Wu, S Rosenman, V Lal, W Che, N Duan Proceedings of the AAAI Conference on Artificial Intelligence 37 (9), 10637 …, 2023	110	2023
Vl-interpret: An interactive visualization tool for interpreting vision-language transformers E Aflalo, M Du, SY Tseng, Y Liu, C Wu, N Duan, V Lal Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022	82	2022
Ldm3d: Latent diffusion model for 3d GBM Stan, D Wofk, S Fox, A Redden, W Saxton, J Yu, E Aflalo, SY Tseng, ... arXiv preprint arXiv:2305.10853, 2023	71	2023
Brain encoding models based on multimodal transformers can transfer across language and vision J Tang, M Du, V Vo, V Lal, A Huth Advances in Neural Information Processing Systems 36, 29654-29666, 2023	68	2023
Lvlm-intrepret: An interpretability tool for large vision-language models G Ben Melech Stan, E Aflalo, RY Rohekar, A Bhiwandiwalla, SY Tseng, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	57	2024
Coco-counterfactuals: Automatically constructed counterfactual examples for image-text pairs T Le, V Lal, P Howard Advances in Neural Information Processing Systems 36, 71195-71221, 2023	46	2023
Socialcounterfactuals: Probing and mitigating intersectional social biases in vision-language models with counterfactual examples P Howard, A Madasu, T Le, GL Moreno, A Bhiwandiwalla, V Lal Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	45	2024
Kd-vlp: Improving end-to-end vision-and-language pretraining with object knowledge distillation Y Liu, C Wu, S Tseng, V Lal, X He, N Duan Findings of the Association for Computational Linguistics: NAACL 2022, 1589-1600, 2022	38	2022
Neurocounterfactuals: Beyond minimal-edit counterfactuals for richer data augmentation P Howard, G Singer, V Lal, Y Choi, S Swayamdipta arXiv preprint arXiv:2210.12365, 2022	31	2022
Getting it Right: Improving Spatial Consistency in Text-to-Image Models A Chatterjee, GBM Stan, E Aflalo, S Paul, D Ghosh, T Gokhale, L Schmidt, ... European Conference on Computer Vision, 204-222, 2024	29	2024
Is your paper being reviewed by an llm? investigating ai text detectability in peer review S Yu, M Luo, A Madasu, V Lal, P Howard arXiv preprint arXiv:2410.03019, 2024	25	2024
InterpreT: An interactive visualization tool for interpreting transformers V Lal, A Ma, E Aflalo, P Howard, A Simoes, D Korat, O Pereg, G Singer, ... Proceedings of the 16th Conference of the European Chapter of the …, 2021	23	2021
Cross-domain aspect extraction using transformers augmented with knowledge graphs P Howard, A Ma, V Lal, AP Simoes, D Korat, O Pereg, M Wasserblat, ... Proceedings of the 31st ACM International Conference on Information …, 2022	22	2022
Llava-gemma: Accelerating multimodal foundation models with a compact language model M Hinck, ML Olson, D Cobbley, SY Tseng, V Lal arXiv preprint arXiv:2404.01331, 2024	21	2024
LVLM-Interpret: an interpretability tool for large vision-language models GBM Stan, E Aflalo, RY Rohekar, A Bhiwandiwalla, SY Tseng, ML Olson, ... arXiv preprint arXiv:2404.03118, 2024	18	2024
Neuroprompts: An adaptive framework to optimize prompts for text-to-image generation S Rosenman, V Lal, P Howard Proceedings of the 18th Conference of the European Chapter of the …, 2024	17	2024
Opinion-based relational pivoting for cross-domain aspect term extraction A Klein, O Pereg, D Korat, V Lal, M Wasserblat, I Dagan Proceedings of the 12th workshop on computational approaches to subjectivity …, 2022	17	2022
Why do llava vision-language models reply to images in english? M Hinck, C Holtermann, ML Olson, F Schneider, S Yu, A Bhiwandiwalla, ... Findings of the Association for Computational Linguistics: EMNLP 2024, 13402 …, 2024	16	2024
Improving video retrieval using multilingual knowledge transfer A Madasu, E Aflalo, G Ben Melech Stan, SY Tseng, G Bertasius, V Lal European Conference on Information Retrieval, 669-684, 2023	16	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors