[go: up one dir, main page]

Follow
Ziru Chen
Title
Cited by
Cited by
Year
Automatic evaluation of attribution by large language models
X Yue, B Wang, Z Chen, K Zhang, Y Su, H Sun
arXiv preprint arXiv:2305.06311, 2023
1422023
Scienceagentbench: Toward rigorous assessment of language agents for data-driven scientific discovery
Z Chen, S Chen, Y Ning, Q Zhang, B Wang, B Yu, Y Li, Z Liao, C Wei, Z Lu, ...
arXiv preprint arXiv:2410.05080, 2024
1022024
Exploring chain-of-thought style prompting for text-to-sql
CY Tai, Z Chen, T Zhang, X Deng, H Sun
arXiv preprint arXiv:2305.14215, 2023
942023
When is tree search useful for llm planning? it depends on the discriminator
Z Chen, M White, R Mooney, A Payani, Y Su, H Sun
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
472024
ecellm: Generalizing large language models for e-commerce from large-scale, high-quality instruction data
B Peng, X Ling, Z Chen, H Sun, X Ning
arXiv preprint arXiv:2402.08831, 2024
462024
Text-to-SQL error correction with language models of code
Z Chen, S Chen, M White, R Mooney, A Payani, J Srinivasa, Y Su, H Sun
arXiv preprint arXiv:2305.13073, 2023
272023
Error detection for text-to-sql semantic parsing
S Chen, Z Chen, H Sun, Y Su
arXiv preprint arXiv:2305.13683, 2023
172023
Tooling or not tooling? the impact of tools on language agents for chemistry problem solving
B Yu, FN Baker, Z Chen, G Herb, B Gou, D Adu-Ampratwum, X Ning, ...
Findings of the Association for Computational Linguistics: NAACL 2025, 7620-7640, 2025
132025
Holistic agent leaderboard: The missing infrastructure for ai agent evaluation
S Kapoor, B Stroebl, P Kirgis, N Nadgir, ZS Siegel, B Wei, T Xue, Z Chen, ...
arXiv preprint arXiv:2510.11977, 2025
42025
GeoAnalystBench: A GeoAI benchmark for assessing large language models for spatial analysis workflow and code generation
Q Zhang, S Gao, C Wei, Y Zhao, Y Nie, Z Chen, S Chen, Y Su, H Sun
Transactions in GIS 29 (7), e70135, 2025
32025
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Y Li, HN Moussa, Z Chen, S Chen, B Yu, M Xue, B Burns, TY Chiu, V Dey, ...
arXiv preprint arXiv:2506.08140, 2025
32025
Roll up your sleeves: Working with a collaborative and engaging task-oriented dialogue system
L Mo, S Chen, Z Chen, X Deng, A Lewis, S Singh, S Stevens, CY Tai, ...
arXiv preprint arXiv:2307.16081, 2023
32023
Large Language Models Achieve Gold Medal Performance at the International Olympiad on Astronomy & Astrophysics (IOAA)
LCD Pinheiro, Z Chen, BC Piazza, N Shroff, Y Liang, YS Ting, H Sun
arXiv preprint arXiv:2510.05016, 2025
22025
Bootstrapping a User-Centered Task-Oriented Dialogue System
S Chen, Z Chen, X Deng, A Lewis, L Mo, S Stevens, Z Wang, X Yue, ...
arXiv preprint arXiv:2207.05223, 2022
22022
Error Detection for Interactive Text-to-SQL Semantic Parsing
S Chen, Z Chen, H Sun, Y Su
2*
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
Y Song, K Ramaneti, Z Sheikh, Z Chen, B Gou, T Xie, Y Xu, D Zhang, ...
arXiv preprint arXiv:2510.24702, 2025
12025
SalsaBot: Towards a Robust and Generalizable Embodied Agent
CH Song, J Wu, JS Byeon, Z Xu, V Pahuja, G Bajaj, S Stevens, Z Chen, ...
Towards a Robust and Generalizable Embodied Agent
CH Song, J Wu, JS Byeon, Z Xu, V Pahuja, G Bajaj, S Stevens, Z Chen, ...
The system can't perform the operation now. Try again later.
Articles 1–18