[go: up one dir, main page]

Follow
Richard He Bai
Richard He Bai
Other namesRichard Bai, He Bai
Research Scientist, Apple Machine Learning Research
Verified email at apple.com - Homepage
Title
Cited by
Cited by
Year
Rephrasing the web: A recipe for compute and data-efficient language modeling
P Maini, S Seto, H Bai, D Grangier, Y Zhang, N Jaitly
ACL 2024, 2024
972024
AT: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
H Bai, R Zheng, J Chen, M Ma, X Li, L Huang
International Conference on Machine Learning, 1399-1411, 2022
612022
Xricl: Cross-lingual retrieval-augmented in-context learning for cross-lingual text-to-sql semantic parsing
P Shi, R Zhang, H Bai, J Lin
EMNLP 2022, 2022
542022
Cross-lingual training of dense retrievers for document retrieval
P Shi, R Zhang, H Bai, J Lin
Proceedings of the 1st Workshop on Multilingual Representation Learning, 251-253, 2021
36*2021
Divide-or-Conquer? Which Part Should You Distill Your LLM?
Z Wu, H Bai, A Zhang, J Gu, VG Vydiswaran, N Jaitly, Y Zhang
EMNLP 2024, 2024
352024
Segatron: Segment-Aware Transformer for Language Modeling and Understanding
H Bai, P Shi, J Lin, Y Xie, L Tan, K Xiong, W Gao, M Li
AAAI 2021, 2020
31*2020
How Far Are We from Intelligent Visual Deductive Reasoning?
Y Zhang, H Bai, R Zhang, J Gu, S Zhai, J Susskind, N Jaitly
COLM 2024, 2024
292024
Cross-lingual training of neural models for document ranking
P Shi, H Bai, J Lin
Findings of the Association for Computational Linguistics: EMNLP 2020, 2768-2773, 2020
282020
Better language model with hypernym class prediction
H Bai, T Wang, A Sordoni, P Shi
ACL 2022, 2022
192022
Source-Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language
H Bai, Y Zhou, J Zhang, L Zhao, MY Hwang, C Zong
COLING 2018, 2018
162018
dmel: Speech tokenization made simple
RH Bai, T Likhomanenko, R Zhang, Z Gu, Z Aldeneh, N Jaitly
arXiv preprint arXiv:2407.15835, 2024
122024
KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn’t Know
S Zheng, H Bai, Y Zhang, Y Su, X Niu, N Jaitly
arXiv preprint arXiv:2312.11539, 2023
10*2023
Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference
H Bai, Y Zhou, J Zhang, C Zong
ACL 2019, 2019
102019
Denoising lm: Pushing the limits of error correction models for speech recognition
Z Gu, T Likhomanenko, H Bai, E McDermott, R Collobert, N Jaitly
arXiv preprint arXiv:2405.15216, 2024
92024
Semantics of the unwritten: The effect of end of paragraph and sequence tokens on text generation with GPT2
H Bai, P Shi, J Lin, L Tan, K Xiong, W Gao, J Liu, M Li
ACL 2021, 2020
9*2020
Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation
A Mousavi, X Zhan, H Bai, P Shi, T Rekatsinas, B Han, Y Li, J Pound, ...
COLING 2024, 2023
72023
Cross-lingual text-to-SQL semantic parsing with representation mixup
P Shi, L Song, L Jin, H Mi, RH Bai, J Lin, D Yu
Findings of the Association for Computational Linguistics: EMNLP 2022, 5296-5306, 2022
62022
Rephrasing the web: A recipe for compute and data-efficient language modeling, 2024
P Maini, S Seto, H Bai, D Grangier, Y Zhang, N Jaitly
URL https://arxiv. org/abs/2401.16380, 0
5
Visatronic: A multimodal decoder-only model for speech synthesis
A Gupta, T Likhomanenko, KD Yang, RH Bai, Z Aldeneh, N Jaitly
arXiv preprint arXiv:2411.17690, 2024
32024
Training bilingual lms with data constraints in the targeted language
S Seto, M Ter Hoeve, RH Bai, N Schluter, D Grangier
Findings of the Association for Computational Linguistics: ACL 2025, 19096-19122, 2025
22025
The system can't perform the operation now. Try again later.
Articles 1–20