[go: up one dir, main page]

Follow
Hadi Khalaf
Hadi Khalaf
Verified email at g.harvard.edu - Homepage
Title
Cited by
Cited by
Year
AI Alignment at Your Discretion
M Buyl*, H Khalaf*, C Mayrink Verdun*, L Monteiro Paes*, ...
ACM Conference on Fairness, Accountability, and Transparency, 2025
112025
Inference-Time Reward Hacking in Large Language Models
H Khalaf, CM Verdun, A Oesterling, H Lakkaraju, FP Calmon
NeurIPS 2025, 2025
32025
The system can't perform the operation now. Try again later.
Articles 1–2