| Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models B An, S Zhu, R Zhang, MA Panaitescu-Liess, Y Xu, F Huang arXiv preprint arXiv:2409.00598, 2024 | 39 | 2024 |
| More Context, Less Distraction: Zero-shot Visual Classification by Inferring and Conditioning on Contextual Attributes B An, S Zhu, MA Panaitescu-Liess, CK Mummadi, F Huang The Twelfth International Conference on Learning Representations, 2024 | 36* | 2024 |
| Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes S Hong, MA Panaitescu-Liess, Y Kaya, T Dumitras Advances in Neural Information Processing Systems 34, 9303-9316, 2021 | 29 | 2021 |
| Can watermarking large language models prevent copyrighted text generation and hide training data? MA Panaitescu-Liess, Z Che, B An, Y Xu, P Pathmanathan, S Chakraborty, ... Proceedings of the AAAI Conference on Artificial Intelligence 39 (23), 25002 …, 2025 | 23* | 2025 |
| Self-supervised representation learning on document images A Cosma, M Ghidoveanu, M Panaitescu-Liess, M Popescu International Workshop on Document Analysis Systems, 103-117, 2020 | 18 | 2020 |
| AdvBDGen: A Robust Framework for Generating Adaptive and Stealthy Backdoors in LLM Alignment Attacks P Pathmanathan, UM Sehwag, MA Panaitescu-Liess, F Huang ICLR 2025 Workshop on Building Trust in Language Models and Applications, 0 | 4* | |
| PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models MA Panaitescu-Liess, P Pathmanathan, Y Kaya, Z Che, B An, S Zhu, ... Neurips Safe Generative AI Workshop 2024, 0 | 3* | |
| RAGPart & RAGMask: Retrieval-Stage Defenses Against Corpus Poisoning in Retrieval-Augmented Generation P Pathmanathan, MA Panaitescu-Liess, CYJ Chiang, F Huang arXiv preprint arXiv:2512.24268, 2025 | 1 | 2025 |
| Like Oil and Water: Group Robustness Methods and Poisoning Defenses Don’t Mix MA Panaitescu-Liess, Y Kaya, S Zhu, F Huang, T Dumitras The Twelfth International Conference on Learning Representations, 2024 | | 2024 |
| Practical Memorization Tests for Detecting Copyrighted Data in Large Language Models MA Panaitescu-Liess, A Palnitkar, A Kambhamettu, Y Kaya, D Brown, ... | | |