[go: up one dir, main page]

Follow
Alex Robey
Alex Robey
Thinking Machines Lab
Verified email at thinkingmachines.ai - Homepage
Title
Cited by
Cited by
Year
Jailbreaking Black Box Large Language Models in Twenty Queries
P Chao, A Robey, E Dobriban, H Hassani, GJ Pappas, E Wong
SaTML 2025, 2023
11772023
Efficient and Accurate Estimation of Lipschitz Constants for Deep Neural Networks
M Fazlyab, A Robey, H Hassani, M Morari, G Pappas
NeurIPS 2019, 2019
6802019
SmoothLLM: Defending Large Language Models against Jailbreaking Attacks
A Robey, E Wong, H Hassani, GJ Pappas
TMLR 2025, 2023
4802023
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
P Chao*, E Debenedetti*, A Robey*, M Andriushchenko*, F Croce, ...
NeurIPS 2024, 2024
4712024
Learning Control Barrier Functions from Expert Demonstrations
A Robey*, H Hu*, L Lindemann, H Zhang, DV Dimarogonas, S Tu, ...
CDC 2020, 2020
3322020
Model-Based Domain Generalization
A Robey, GJ Pappas, H Hassani
NeurIPS 2021, 2021
1782021
Probable Domain Generalization via Quantile Risk Minimization
C Eastwood*, A Robey*, S Singh, J von Kügelgen, H Hassani, GJ Pappas, ...
NeurIPS 2022, 2022
982022
Model-Based Robust Deep Learning: Generalizing to Natural, Out-of-Distribution Data
A Robey, H Hassani, GJ Pappas
arXiv, 2020
922020
Learning Hybrid Control Barrier Functions from Data
L Lindemann, H Hu, A Robey, H Zhang, DV Dimarogonas, S Tu, N Matni
CoRL 2020, 2020
902020
Provable Tradeoffs in Adversarially Robust Classification
E Dobriban, H Hassani, D Hong, A Robey
Transactions on Information Theory 2023, 2020
862020
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
J Ji*, B Hou*, A Robey*, GJ Pappas, H Hassani, Y Zhang, E Wong, ...
IJCNLP-AACL 2025, 2025
772025
A Safe Harbor for AI Evaluation and Red Teaming
S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ...
ICML 2024, 2024
72*2024
Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations
L Lindemann, A Robey, L Jiang, S Das, S Tu, N Matni
OJCSYS 2024, 2021
722021
Jailbreaking LLM-Controlled Robots
A Robey, Z Ravichandran, V Kumar, H Hassani, GJ Pappas
ICRA 2025, 2024
652024
On the Sample Complexity of Stability Constrained Imitation Learning
S Tu, A Robey, T Zhang, N Matni
L4DC 2022, 2022
64*2022
Adversarial Robustness with Semi-Infinite Constrained Learning
A Robey*, L Chamon*, GJ Pappas, H Hassani, A Ribeiro
NeurIPS 2021, 2021
572021
Probabilistically Robust Learning: Balancing Average- and Worst-case Performance
A Robey, LFO Chamon, GJ Pappas, H Hassani
ICML 2022, 2022
552022
Learning Robust Hybrid Control Barrier Functions for Uncertain Systems
A Robey*, L Lindemann*, S Tu, N Matni
ADHS 2021, 2021
502021
Toward Certified Robustness Against Real-World Distribution Shifts
H Wu*, T Tagomori*, A Robey*, F Yang*, N Matni, G Pappas, H Hassani, ...
SaTML 2023, 2022
362022
Optimal Algorithms for Submodular Maximization With Distributed Constraints
A Robey, A Adibi, B Schlotfeldt, H Hassani, GJ Pappas
L4DC 2021, 2021
262021
The system can't perform the operation now. Try again later.
Articles 1–20