[go: up one dir, main page]

Follow
Jongmin Lee
Jongmin Lee
Verified email at yonsei.ac.kr - Homepage
Title
Cited by
Cited by
Year
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
J Lee, W Jeon, BJ Lee, J Pineau, KE Kim
ICML, 2021
1532021
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations
GH Kim, S Seo, J Lee, W Jeon, HJ Hwang, H Yang, KE Kim
International Conference on Learning Representations (ICLR), 2022
1282022
Monte-Carlo Tree Search for Constrained POMDPs
J Lee, GH Kim, P Poupart, KE Kim
NeurIPS, 2018
952018
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
J Lee, C Paduraru, DJ Mankowitz, N Heess, D Precup, KE Kim, A Guez
International Conference on Learning Representations (ICLR), 2022
902022
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
Y Jang, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2022
752022
Multi-view automatic lip-reading using neural network
D Lee, J Lee, KE Kim
Asian conference on computer vision, 290-302, 2016
732016
Representation balancing offline model-based reinforcement learning
BJ Lee, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2021
632021
Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients
J Lee, W Jeon, GH Kim, KE Kim
AAAI, 2020
422020
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation
GH Kim, J Lee, Y Jang, H Yang, KE Kim
Advances in Neural Information Processing Systems (NeurIPS), 2022
35*2022
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
Y Jang, J Lee, KE Kim
AAAI, 2020
302020
Batch Reinforcement Learning with Hyperparameter Gradients
BJ Lee, J Lee, P Vrancx, D Kim, KE Kim
ICML, 2020
252020
Reinforcement Learning for Control with Multiple Frequencies
J Lee, BJ Lee, KE Kim
Advances in Neural Information Processing Systems (NeurIPS) 33, 2020
222020
Hierarchically-partitioned Gaussian Process Approximation
BJ Lee, J Lee, KE Kim
Artificial Intelligence and Statistics (AISTATS), 822-831, 2017
212017
Monte-carlo planning and learning with language action value estimates
Y Jang, S Seo, J Lee, KE Kim
International Conference on Learning Representations (ICLR), 2021
162021
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
DE Matsunaga, J Lee, J Yoon, S Leonardos, P Abbeel, KE Kim
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023
122023
PyOpenDial: a python-based domain-independent toolkit for developing spoken dialogue systems with probabilistic rules
Y Jang, J Lee, J Park, KH Lee, P Lison, KE Kim
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
122019
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming
J Lee, Y Jang, P Poupart, KE Kim
IJCAI, 2088-2095, 2017
122017
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
H Lee, J Lee, Y Choi, W Jeon, BJ Lee, YK Noh, KE Kim
Advances in Neural Information Processing Systems (NeurIPS), 2022
72022
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations
Y Jang, GH Kim, J Lee, S Sohn, B Kim, H Lee, M Lee
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023
62023
Tempo Adaption in Non-stationary Reinforcement Learning
H Lee, Y Ding, J Lee, M Jin, J Lavaei, S Sojoudi
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023
62023
The system can't perform the operation now. Try again later.
Articles 1–20