[go: up one dir, main page]

Follow
Biao Zhang
Biao Zhang
Verified email at google.com
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
70112023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
34542024
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
14222025
Root mean square layer normalization
B Zhang, R Sennrich
NeurIPS, 2019
14082019
Prompting large language model for machine translation: A case study
B Zhang, B Haddow, A Birch
International Conference on Machine Learning, 41092-41110, 2023
4522023
Improving massively multilingual neural machine translation and zero-shot translation
B Zhang, P Williams, I Titov, R Sennrich
ACL, 2020
4392020
Revisiting low-resource neural machine translation: A case study
R Sennrich, B Zhang
ACL, 2019
3172019
When scaling meets llm finetuning: The effect of data, model and finetuning method
B Zhang, Z Liu, C Cherry, O Firat
arXiv preprint arXiv:2402.17193, 2024
2872024
Variational neural machine translation
B Zhang, D Xiong, J Su, H Duan, M Zhang
EMNLP, 2016
2472016
Direct language model alignment from online ai feedback
S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ...
arXiv preprint arXiv:2402.04792, 2024
246*2024
Many-shot in-context learning
R Agarwal, A Singh, L Zhang, B Bohnet, L Rosias, S Chan, B Zhang, ...
Advances in Neural Information Processing Systems 37, 76930-76966, 2024
2162024
Madlad-400: A multilingual and document-level large audited dataset
S Kudugunta, I Caswell, B Zhang, X Garcia, D Xin, A Kusupati, R Stella, ...
Advances in Neural Information Processing Systems 36, 67284-67296, 2023
2162023
Neural machine translation with GRU-gated attention model
B Zhang, D Xiong, J Xie, J Su
TNNLS 31 (11), 4688-4698, 2020
172*2020
Neural machine translation with deep attention
B Zhang, D Xiong, J Su
TPAMI 42 (1), 154-163, 2018
1442018
Shallow convolutional neural network for implicit discourse relation recognition
B Zhang, J Su, D Xiong, Y Lu, H Duan, J Yao
EMNLP, 2230-2235, 2015
1402015
Accelerating neural transformer via an average attention network
B Zhang, D Xiong, J Su
ACL, 2018
1342018
Improving deep transformer with depth-scaled initialization and merged attention
B Zhang, I Titov, R Sennrich
EMNLP, 2019
1202019
Share or not? learning to schedule language-specific capacity for multilingual translation
B Zhang, A Bapna, R Sennrich, O Firat
ICLR, 2020
1022020
Variational recurrent neural machine translation
J Su, S Wu, D Xiong, Y Lu, X Han, B Zhang
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
952018
SLTUNET: A simple unified model for sign language translation
B Zhang, M Müller, R Sennrich
arXiv preprint arXiv:2305.01778, 2023
922023
The system can't perform the operation now. Try again later.
Articles 1–20