[go: up one dir, main page]

Follow
Soroush Pour
Soroush Pour
Harmony Intelligence
Verified email at soroushjp.com - Homepage
Title
Cited by
Cited by
Year
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation
R Shah, Q Feuillade--Montixi, S Pour, A Tagade, S Casper, J Rando
arXiv preprint arXiv:2311.03348, 2023
1842023
The AI Risk Repository: A Comprehensive Meta-Review, Database, and Taxonomy of Risks From Artificial Intelligence
P Slattery, AK Saeri, EAC Grundy, J Graham, M Noetel, R Uuk, J Dao, ...
arXiv preprint arXiv:2408.12622, 2024
1332024
The system can't perform the operation now. Try again later.
Articles 1–2