[go: up one dir, main page]

Follow
Wesley A Suttle
Wesley A Suttle
Distinguished Postdoctoral Fellow, U.S. Army Research Laboratory
Verified email at army.mil
Title
Cited by
Cited by
Year
A multi-agent off-policy actor-critic algorithm for distributed reinforcement learning
WA Suttle, Z Yang, K Zhang, Z Wang, T Başar, J Liu
IFAC-PapersOnLine 53 (2), 1549-1554, 2020
922020
Aime: Ai system optimization via multiple llm evaluators
B Patel, S Chakraborty, WA Suttle, M Wang, AS Bedi, D Manocha
arXiv preprint arXiv:2410.03131, 2024
272024
Beyond exponentially fast mixing in average-reward reinforcement learning via multi-level monte carlo actor-critic
WA Suttle, A Bedi, B Patel, BM Sadler, A Koppel, D Manocha
International Conference on Machine Learning, 33240-33267, 2023
212023
Lancar: Leveraging language for context-aware robot locomotion in unstructured environments
CL Shek, X Wu, WA Suttle, C Busart, E Zaroukian, D Manocha, P Tokekar, ...
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024
202024
Reinforcement learning for cost-aware Markov decision processes
WA Suttle, K Zhang, Z Yang, J Liu, D Kraemer
International Conference on Machine Learning, 9989-9999, 2021
142021
Deceptive path planning via reinforcement learning with graph neural networks
MY Fatemi, WA Suttle, BM Sadler
International Conference on Autonomous Agents and Multi-agent Systems, 2258-2260, 2024
72024
Sampling-based safe reinforcement learning for nonlinear dynamical systems
W Suttle, VK Sharma, KC Kosaraju, S Seetharaman, J Liu, V Gupta, ...
International Conference on Artificial Intelligence and Statistics, 4420-4428, 2024
62024
Reinforcement learning based distributed control of dissipative networked systems
KC Kosaraju, S Sivaranjani, W Suttle, V Gupta, J Liu
IEEE Transactions on Control of Network Systems 9 (2), 856-866, 2021
62021
PIPER: Primitive-informed preference-based hierarchical reinforcement learning via hindsight relabeling
U Singh, WA Suttle, BM Sadler, VP Namboodiri, AS Bedi
International Conference on Machine Learning (ICML), 2024
52024
Global optimality without mixing time oracles in average-reward rl via multi-level actor-critic
B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, AS Bedi, ...
CoRR, 2024
52024
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
WA Suttle, A Suresh, C Nieto-Granda
International Conference on Learning Representations (ICLR), 2025
42025
Occupancy information ratio: Infinite-horizon, information-directed, parameterized policy search
WA Suttle, A Koppel, J Liu
SIAM Journal on Control and Optimization 62 (6), 3145-3171, 2024
42024
Towards global optimality for practical average reward reinforcement learning without mixing time oracles
B Patel, WA Suttle, A Koppel, V Aggarwal, BM Sadler, AS Bedi, ...
International Conference on Machine Learning, 39889 - 39907, 2024
42024
Ada-nav: Adaptive trajectory-based sample efficient policy learning for robotic navigation
B Patel, K Weerakoon, WA Suttle, A Koppel, BM Sadler, AS Bedi, ...
arXiv preprint arXiv:2306.06192, 2023
32023
Value of Information-based Deceptive Path Planning Under Adversarial Interventions
WA Suttle, J Milzman, MO Karabag, BM Sadler, U Topcu
2025 IEEE Conference on Decision and Control (CDC), 2025
22025
A convergence result for regularized actor-critic methods
W Suttle, Z Yang, K Zhang, J Liu
arXiv preprint arXiv:1907.06138, 2019
22019
Scalable natural policy gradient for general-sum linear quadratic games with known parameters
MM Shibl, WA Suttle, V Gupta
Proceedings of Machine Learning Research vol 283, 1-14, 2025
12025
Policy Gradient for Ratio Optimization: A Case Study
WA Suttle, A Koppel, J Liu
2022 56th Annual Conference on Information Sciences and Systems (CISS), 281-286, 2022
12022
IMAS: Joint Agent Selection and Information-Theoretic Coordinated Perception In Dec-POMDPs
C Shi, WA Suttle, M Dorothy, J Fu
arXiv preprint arXiv:2510.20009, 2025
2025
Deceptive Exploration in Multi-armed Bandits
IA Vurankaya, MO Karabag, WA Suttle, J Milzman, D Fridovich-Keil, ...
arXiv preprint arXiv:2510.08794, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–20