[go: up one dir, main page]

Follow
Ilya Kostrikov
Ilya Kostrikov
OpenAI
Verified email at openai.com - Homepage
Title
Cited by
Cited by
Year
Gpt-4o system card
A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ...
arXiv preprint arXiv:2410.21276, 2024
36852024
Openai o1 system card
A Jaech, A Kalai, A Lerer, A Richardson, A El-Kishky, A Low, A Helyar, ...
arXiv preprint arXiv:2412.16720, 2024
15182024
Offline reinforcement learning with implicit q-learning
I Kostrikov, A Nair, S Levine
arXiv preprint arXiv:2110.06169, 2021
14332021
Image augmentation is all you need: Regularizing deep reinforcement learning from pixels
I Kostrikov*, D Yarats*, R Fergus
arXiv preprint arXiv:2004.13649, 2020
1094*2020
Training diffusion models with reinforcement learning
K Black, M Janner, Y Du, I Kostrikov, S Levine
arXiv preprint arXiv:2305.13301, 2023
6742023
Planet-photo geolocation with convolutional neural networks
T Weyand, I Kostrikov, J Philbin
European conference on computer vision, 37-55, 2016
6232016
Improving sample efficiency in model-free reinforcement learning from images
D Yarats, A Zhang, I Kostrikov, B Amos, J Pineau, R Fergus
Proceedings of the aaai conference on artificial intelligence 35 (12), 10674 …, 2021
5942021
Intrinsic motivation and automatic curricula via asymmetric self-play
S Sukhbaatar, Z Lin, I Kostrikov, G Synnaeve, A Szlam, R Fergus
arXiv preprint arXiv:1703.05407, 2017
4862017
Discriminator-actor-critic: Addressing sample inefficiency and reward bias in adversarial imitation learning
I Kostrikov, KK Agrawal, D Dwibedi, S Levine, J Tompson
arXiv preprint arXiv:1809.02925, 2018
4082018
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
I Kostrikov, J Tompson, R Fergus, O Nachum
arXiv preprint arXiv:2103.08050, 2021
4072021
Efficient Online Reinforcement Learning with Offline Data
PJ Ball*, L Smith*, I Kostrikov*, S Levine
arXiv preprint arXiv:2302.02948, 2023
3192023
Automatic data augmentation for generalization in deep reinforcement learning
R Raileanu, M Goldstein, D Yarats, I Kostrikov, R Fergus
arXiv preprint arXiv:2006.12862, 2020
303*2020
Algaedice: Policy gradient from arbitrary experience
O Nachum, B Dai, I Kostrikov, Y Chow, L Li, D Schuurmans
arXiv preprint arXiv:1912.02074, 2019
3002019
Imitation learning via off-policy distribution matching
I Kostrikov, O Nachum, J Tompson
arXiv preprint arXiv:1912.05032, 2019
2792019
Rvs: What is essential for offline rl via supervised learning?
S Emmons, B Eysenbach, I Kostrikov, S Levine
arXiv preprint arXiv:2112.10751, 2021
2762021
Pytorch implementations of reinforcement learning algorithms
I Kostrikov
GitHub repository: https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail, 2018
2582018
Idql: Implicit q-learning as an actor-critic method with diffusion policies
P Hansen-Estruch, I Kostrikov, M Janner, JG Kuba, S Levine
arXiv preprint arXiv:2304.10573, 2023
2332023
An efficient convolutional network for human pose estimation.
U Rafi, B Leibe, J Gall, I Kostrikov
BMVC 1, 2, 2016
1882016
A walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning
L Smith*, I Kostrikov*, S Levine
arXiv preprint arXiv:2208.07860, 2022
187*2022
Offline rl for natural language generation with implicit language q learning
C Snell, I Kostrikov, Y Su, M Yang, S Levine
arXiv preprint arXiv:2206.11871, 2022
1652022
The system can't perform the operation now. Try again later.
Articles 1–20