[go: up one dir, main page]

Follow
Krishnamurthy Dvijotham
Krishnamurthy Dvijotham
Google DeepMind
Verified email at cs.washington.edu - Homepage
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
34392024
On the effectiveness of interval bound propagation for training verifiably robust models
S Gowal, K Dvijotham, R Stanforth, R Bunel, C Qin, J Uesato, ...
arXiv preprint arXiv:1810.12715, 2018
6962018
Safe exploration in continuous action spaces
G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa
arXiv preprint arXiv:1801.08757, 2018
6752018
A Dual Approach to Scalable Verification of Deep Networks.
K Dvijotham, R Stanforth, S Gowal, TA Mann, P Kohli
UAI 1 (2), 3, 2018
5112018
Adversarial robustness through local linearization
C Qin, J Martens, S Gowal, D Krishnan, K Dvijotham, A Fawzi, S De, ...
Advances in neural information processing systems 32, 2019
3892019
A fine-grained analysis on distribution shift
O Wiles, S Gowal, F Stimberg, S Alvise-Rebuffi, I Ktena, K Dvijotham, ...
arXiv preprint arXiv:2110.11328, 2021
3212021
Real-time optimal power flow
Y Tang, K Dvijotham, S Low
IEEE Transactions on Smart Grid 8 (6), 2963-2973, 2017
3052017
Scalable verified training for provably robust image classification
S Gowal, KD Dvijotham, R Stanforth, R Bunel, C Qin, J Uesato, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
2452019
Inverse optimal control with linearly-solvable MDPs
K Dvijotham, E Todorov
Proceedings of the 27th International conference on machine learning (ICML …, 2010
2242010
Achieving verified robustness to symbol substitutions via interval bound propagation
PS Huang, R Stanforth, J Welbl, C Dyer, D Yogatama, S Gowal, ...
arXiv preprint arXiv:1909.01492, 2019
2072019
Training verified learners with learned verifiers
K Dvijotham, S Gowal, R Stanforth, R Arandjelovic, B O'Donoghue, ...
arXiv preprint arXiv:1805.10265, 2018
2022018
Rich human feedback for text-to-image generation
Y Liang, J He, G Li, P Li, A Klimovskiy, N Carolan, J Sun, J Pont-Tuset, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1792024
Stealing part of a production language model
N Carlini, D Paleka, KD Dvijotham, T Steinke, J Hayase, AF Cooper, ...
arXiv preprint arXiv:2403.06634, 2024
1592024
Learning optimal conformal classifiers
D Stutz, AT Cemgil, A Doucet
arXiv preprint arXiv:2110.09192, 2021
1452021
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
S Dathathri, K Dvijotham, A Kurakin, A Raghunathan, J Uesato, RR Bunel, ...
Advances in Neural Information Processing Systems 33, 5318-5331, 2020
1452020
Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians
K Dvijotham, J Winkens, M Barsbey, S Ghaisas, R Stanforth, N Pawlowski, ...
Nature Medicine 29 (7), 1814-1820, 2023
1302023
Helping or herding? reward model ensembles mitigate but do not eliminate reward hacking
J Eisenstein, C Nagpal, A Agarwal, A Beirami, A D'Amour, DJ Dvijotham, ...
arXiv preprint arXiv:2312.09244, 2023
1262023
The autoencoding variational autoencoder
T Cemgil, S Ghaisas, K Dvijotham, S Gowal, P Kohli
Advances in Neural Information Processing Systems 33, 15077-15087, 2020
1182020
Rigorous agent evaluation: An adversarial approach to uncover catastrophic failures
J Uesato, A Kumar, C Szepesvari, T Erez, A Ruderman, K Anderson, ...
arXiv preprint arXiv:1812.01647, 2018
1052018
Interactive concept bottleneck models
K Chauhan, R Tiwari, J Freyberg, P Shenoy, K Dvijotham
Proceedings of the aaai conference on artificial intelligence 37 (5), 5948-5955, 2023
972023
The system can't perform the operation now. Try again later.
Articles 1–20