[go: up one dir, main page]

Follow
Rishabh Joshi
Rishabh Joshi
Google Deepmind, ex Brain Team
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
69922023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
34392024
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13372025
Slic-hf: Sequence likelihood calibration with human feedback
Y Zhao, R Joshi, T Liu, M Khalman, M Saleh, PJ Liu
arXiv preprint arXiv:2305.10425, 2023
4072023
Aerobic bacterial isolates from burn wound infections and their antibiograms—a five-year study
N Agnihotri, V Gupta, RM Joshi
Burns 30 (3), 241-243, 2004
4012004
Reside: Improving distantly-supervised neural relation extraction using side information
S Vashishth, R Joshi, SS Prayaga, C Bhattacharyya, P Talukdar
arXiv preprint arXiv:1812.04361, 2018
3232018
Statistical rejection sampling improves preference optimization
T Liu, Y Zhao, R Joshi, M Khalman, M Saleh, PJ Liu, J Liu
arXiv preprint arXiv:2309.06657, 2023
3102023
Calibrating sequence likelihood improves conditional language generation
Y Zhao, M Khalman, R Joshi, S Narayan, M Saleh, PJ Liu
arXiv preprint arXiv:2210.00045, 2022
1762022
Lipo: Listwise preference optimization through learning-to-rank
T Liu, Z Qin, J Wu, J Shen, M Khalman, R Joshi, Y Zhao, M Saleh, ...
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of …, 2025
762025
Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets
S Vashishth, D Newman-Griffis, R Joshi, R Dutt, CP Rosé
Journal of biomedical informatics 121, 103880, 2021
592021
Human alignment of large language models through online preference optimisation
D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ...
arXiv preprint arXiv:2403.08635, 2024
582024
Building math agents with multi-turn iterative preference learning
W Xiong, C Shi, J Shen, A Rosenberg, Z Qin, D Calandriello, M Khalman, ...
arXiv preprint arXiv:2409.02392, 2024
572024
DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues
R Joshi, V Balachandran, S Vashishth, AW Black, Y Tsvetkov
Proceedings of International Conference on Learning Representations 2021 (ICLR), 2021
482021
Analysing the extent of misinformation in cancer related tweets
R Bal, S Sinha, S Dutta, R Joshi, S Ghosh, R Dutt
Proceedings of the International AAAI Conference on Web and Social Media 14 …, 2020
422020
Rrm: Robust reward model training mitigates reward hacking
T Liu, W Xiong, J Ren, L Chen, J Wu, R Joshi, Y Gao, J Shen, Z Qin, T Yu, ...
arXiv preprint arXiv:2409.13156, 2024
412024
Offline regularised reinforcement learning for large language models alignment
PH Richemond, Y Tang, D Guo, D Calandriello, MG Azar, R Rafailov, ...
arXiv preprint arXiv:2405.19107, 2024
302024
Resper: Computationally modelling resisting strategies in persuasive conversations
R Dutt, S Sinha, R Joshi, SS Chakraborty, M Riggs, X Yan, H Bao, C Rose
Proceedings of the 16th Conference of the European Chapter of the …, 2021
272021
Fully automated sample preparation for pathogen detection performed in a microfluidic cassette
MT Taylor, P Belgrader, R Joshi, GA Kintz, MA Northrup
Micro Total Analysis Systems 2001: Proceedings of the µTAS 2001 Symposium …, 2001
202001
Keeping up appearances: Computational modeling of face acts in persuasion oriented discussions
R Dutt, R Joshi, C Rose
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
182020
Evolving alignment via asymmetric self-play
Z Ye, R Agarwal, T Liu, R Joshi, S Velury, Q Tan, Y Liu
122024
The system can't perform the operation now. Try again later.
Articles 1–20