[go: up one dir, main page]

Follow
Ruo Yu Tao
Ruo Yu Tao
Other namesDavid Tao
Verified email at brown.edu - Homepage
Title
Cited by
Cited by
Year
Textworld: A learning environment for text-based games
MA Côté, Á Kádár, X Yuan, B Kybartas, T Barnes, E Fine, J Moore, ...
arXiv preprint arXiv:1806.11532, 2018
4402018
Novelty Search in representational space for sample efficient exploration
RY Tao, V François-Lavet, J Pineau
Advances in Neural Information Processing Systems 33, 2020
642020
Towards solving text-based games by producing adaptive action spaces
RY Tao, MA Côté, X Yuan, LE Asri
arXiv preprint arXiv:1812.00855, 2018
162018
Measuring and mitigating interference in reinforcement learning
V Liu, H Wang, RY Tao, K Javed, A White, M White
Conference on Lifelong Learning Agents, 781-795, 2023
92023
Mitigating partial observability in sequential decision processes via the lambda discrepancy
C Allen, A Kirtland, RY Tao, S Lobel, D Scott, N Petrocelli, O Gottesman, ...
Advances in Neural Information Processing Systems 37, 62988-63028, 2024
72024
Agent-state construction with auxiliary inputs
RY Tao, A White, MC Machado
arXiv preprint arXiv:2211.07805, 2022
52022
Benchmarking partial observability in reinforcement learning with a suite of memory-improvable domains
RY Tao, K Guo, C Allen, G Konidaris
arXiv preprint arXiv:2508.00046, 2025
32025
General value discrepancies mitigate partial observability in reinforcement learning
P Koepernik, RY Tao, R Parr, G Konidaris, C Allen
Finding the Frame Workshop at RLC 2025, 2025
12025
Spectral Collapse Drives Loss of Plasticity in Deep Continual Learning
N He, K Guo, A Prakash, S Tiwari, RY Tao, T Serapio, A Greenwald, ...
arXiv preprint arXiv:2509.22335, 2025
2025
RL: Generic reinforcement learning codebase in TensorFlow
BM Li, A Cowen-Rivers, P Kozakowski, D Tao, SR Kamalakara, ...
Journal of Open Source Software 4 (42), 1524, 2019
2019
Robust Linear Reinforcement Learning
S Lobel, RY Tao, T Akbulut
Resolving Partial Observability in Decision Processes via the Lambda Discrepancy
C Allen, AT Kirtland, RY Tao, D Scott, S Lobel, N Petrocelli, O Gottesman, ...
The system can't perform the operation now. Try again later.
Articles 1–12