[go: up one dir, main page]

Follow
Shao-Yen Tseng
Shao-Yen Tseng
Intel Labs
Verified email at intel.com - Homepage
Title
Cited by
Cited by
Year
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
1892024
Vl-interpret: An interactive visualization tool for interpreting vision-language transformers
E Aflalo, M Du, SY Tseng, Y Liu, C Wu, N Duan, V Lal
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2022
822022
Ldm3d: Latent diffusion model for 3d
GBM Stan, D Wofk, S Fox, A Redden, W Saxton, J Yu, E Aflalo, SY Tseng, ...
arXiv preprint arXiv:2305.10853, 2023
712023
LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models
G Ben Melech Stan, R Yehezkel Rohekar, Y Gurwicz, ML Olson, ...
arXiv e-prints, arXiv: 2404.03118, 2024
572024
Kd-vlp: Improving end-to-end vision-and-language pretraining with object knowledge distillation
Y Liu, C Wu, S Tseng, V Lal, X He, N Duan
Findings of the Association for Computational Linguistics: NAACL 2022, 1589-1600, 2022
382022
Design of heart rate variability processor for portable 3-lead ECG monitoring system-on-chip
WC Fang, HC Huang, SY Tseng
Expert Systems with Applications 40 (5), 1491-1504, 2013
382013
Advanced ECG processor with HRV analysis for real-time portable health monitoring
CC Chou, SY Tseng, E Chua, YC Lee, WC Fang, HC Huang
2011 IEEE International Conference on Consumer Electronics-Berlin (ICCE …, 2011
292011
Multimodal embeddings from language models for emotion recognition in the wild
SY Tseng, S Narayanan, P Georgiou
IEEE Signal Processing Letters 28, 608-612, 2021
252021
Multiple instance deep learning for weakly supervised small-footprint audio event detection
SY Tseng, J Li, Y Wang, J Szurley, F Metze, S Das
arXiv preprint arXiv:1712.09673, 2017
252017
Couples Behavior Modeling and Annotation Using Low-Resource LSTM Language Models.
SY Tseng, SN Chakravarthula, BR Baucom, PG Georgiou
Interspeech, 898-902, 2016
222016
Llava-gemma: Accelerating multimodal foundation models with a compact language model
M Hinck, ML Olson, D Cobbley, SY Tseng, V Lal
arXiv preprint arXiv:2404.01331, 2024
212024
Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings.
SY Tseng, BR Baucom, PG Georgiou
Interspeech, 3291-3295, 2017
192017
LVLM-Interpret: An Interpretability Tool for Large Vision-Language Models
GBM Stan, RY Rohekar, Y Gurwicz, ML Olson, A Bhiwandiwalla, E Aflalo, ...
arXiv preprint arXiv:2404.03118, 2024
182024
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
182020
A low power biomedical signal processing system-on-chip design for portable brain-heart monitoring systems
WC Fang, CK Chen, E Chua, CC Fu, SY Tseng, S Kang
The 2010 International Conference on Green Circuits and Systems, 18-23, 2010
172010
Why do llava vision-language models reply to images in english?
M Hinck, C Holtermann, ML Olson, F Schneider, S Yu, A Bhiwandiwalla, ...
Findings of the Association for Computational Linguistics: EMNLP 2024, 13402 …, 2024
162024
Improving video retrieval using multilingual knowledge transfer
A Madasu, E Aflalo, G Ben Melech Stan, SY Tseng, G Bertasius, V Lal
European Conference on Information Retrieval, 669-684, 2023
162023
Predicting behavior in cancer-afflicted patient and spouse interactions using speech and language
SN Chakravarthula, H Li, SY Tseng, M Reblin, P Georgiou
arXiv preprint arXiv:1908.00908, 2019
142019
A wireless biomedical sensor network using IEEE802. 15.4
SY Tseng, CH Tsai, YS Lai, WC Fang
2009 IEEE/NIH Life Science Systems and Applications Workshop, 183-186, 2009
142009
Calm: Contrastive aligned audio-language multirate and multimodal representations
V Sachidananda, SY Tseng, E Marchi, S Kajarekar, P Georgiou
arXiv preprint arXiv:2202.03587, 2022
132022
The system can't perform the operation now. Try again later.
Articles 1–20