Shao-Yen Tseng

Cited by

	All	Since 2021
Citations	888	691
h-index	16	12
i10-index	25	15

360

180

270

201020112012201320142015201620172018201920202021202220232024202520265 12 9 15 12 21 16 22 16 25 40 40 43 67 173 359 9

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Shao-Yen Tseng

Intel Labs

Verified email at intel.com - Homepage

Machine Learning Natural Language Processing Speech and Audio Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
xgen-mm (blip-3): A family of open large multimodal models L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ... arXiv preprint arXiv:2408.08872, 2024	189	2024
Vl-interpret: An interactive visualization tool for interpreting vision-language transformers E Aflalo, M Du, SY Tseng, Y Liu, C Wu, N Duan, V Lal Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022	82	2022
Ldm3d: Latent diffusion model for 3d GBM Stan, D Wofk, S Fox, A Redden, W Saxton, J Yu, E Aflalo, SY Tseng, ... arXiv preprint arXiv:2305.10853, 2023	71	2023
LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models G Ben Melech Stan, R Yehezkel Rohekar, Y Gurwicz, ML Olson, ... arXiv e-prints, arXiv: 2404.03118, 2024	57	2024
Kd-vlp: Improving end-to-end vision-and-language pretraining with object knowledge distillation Y Liu, C Wu, S Tseng, V Lal, X He, N Duan Findings of the Association for Computational Linguistics: NAACL 2022, 1589-1600, 2022	38	2022
Design of heart rate variability processor for portable 3-lead ECG monitoring system-on-chip WC Fang, HC Huang, SY Tseng Expert Systems with Applications 40 (5), 1491-1504, 2013	38	2013
Advanced ECG processor with HRV analysis for real-time portable health monitoring CC Chou, SY Tseng, E Chua, YC Lee, WC Fang, HC Huang 2011 IEEE International Conference on Consumer Electronics-Berlin (ICCE …, 2011	29	2011
Multimodal embeddings from language models for emotion recognition in the wild SY Tseng, S Narayanan, P Georgiou IEEE Signal Processing Letters 28, 608-612, 2021	25	2021
Multiple instance deep learning for weakly supervised small-footprint audio event detection SY Tseng, J Li, Y Wang, J Szurley, F Metze, S Das arXiv preprint arXiv:1712.09673, 2017	25	2017
Couples Behavior Modeling and Annotation Using Low-Resource LSTM Language Models. SY Tseng, SN Chakravarthula, BR Baucom, PG Georgiou Interspeech, 898-902, 2016	22	2016
Llava-gemma: Accelerating multimodal foundation models with a compact language model M Hinck, ML Olson, D Cobbley, SY Tseng, V Lal arXiv preprint arXiv:2404.01331, 2024	21	2024
Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings. SY Tseng, BR Baucom, PG Georgiou Interspeech, 3291-3295, 2017	19	2017
LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models GBM Stan, RY Rohekar, Y Gurwicz, ML Olson, A Bhiwandiwalla, E Aflalo, ... arXiv preprint arXiv:2404.03118, 2024	18	2024
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	18	2020
A low power biomedical signal processing system-on-chip design for portable brain-heart monitoring systems WC Fang, CK Chen, E Chua, CC Fu, SY Tseng, S Kang The 2010 International Conference on Green Circuits and Systems, 18-23, 2010	17	2010
Why do llava vision-language models reply to images in english? M Hinck, C Holtermann, ML Olson, F Schneider, S Yu, A Bhiwandiwalla, ... Findings of the Association for Computational Linguistics: EMNLP 2024, 13402 …, 2024	16	2024
Improving video retrieval using multilingual knowledge transfer A Madasu, E Aflalo, G Ben Melech Stan, SY Tseng, G Bertasius, V Lal European Conference on Information Retrieval, 669-684, 2023	16	2023
Predicting behavior in cancer-afflicted patient and spouse interactions using speech and language SN Chakravarthula, H Li, SY Tseng, M Reblin, P Georgiou arXiv preprint arXiv:1908.00908, 2019	14	2019
A wireless biomedical sensor network using IEEE802. 15.4 SY Tseng, CH Tsai, YS Lai, WC Fang 2009 IEEE/NIH Life Science Systems and Applications Workshop, 183-186, 2009	14	2009
Calm: Contrastive aligned audio-language multirate and multimodal representations V Sachidananda, SY Tseng, E Marchi, S Kajarekar, P Georgiou arXiv preprint arXiv:2202.03587, 2022	13	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by