[go: up one dir, main page]

Follow
Junlin Wu
Title
Cited by
Cited by
Year
RLHFPoison: Reward poisoning attack for reinforcement learning with human feedback in large language models
J Wang, J Wu, M Chen, Y Vorobeychik, C Xiao
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
51*2024
Neural lyapunov control for discrete-time systems
J Wu, A Clark, Y Kantaros, Y Vorobeychik
Advances in neural information processing systems 36, 2939-2955, 2023
462023
Axioms for ai alignment from human feedback
L Ge, D Halpern, E Micha, AD Procaccia, I Shapira, Y Vorobeychik, J Wu
Advances in Neural Information Processing Systems 37, 80439-80465, 2024
452024
Robust deep reinforcement learning through bootstrapped opportunistic curriculum
J Wu, Y Vorobeychik
International Conference on Machine Learning, 24177-24211, 2022
372022
Exact verification of relu neural control barrier functions
H Zhang, J Wu, Y Vorobeychik, A Clark
Advances in neural information processing systems 36, 5685-5705, 2023
362023
Preference poisoning attacks on reward model learning
J Wu, J Wang, C Xiao, C Wang, N Zhang, Y Vorobeychik
2025 IEEE Symposium on Security and Privacy (SP), 1622-1640, 2025
152025
Manipulating Elections by Changing Voter Perceptions
J Wu, A Estornell, L Kong, Y Vorobeychik
Proceedings of the Thirty-First International Joint Conference on Artificial …, 2022
82022
Verified safe reinforcement learning for neural network dynamic models
J Wu, H Zhang, Y Vorobeychik
Advances in Neural Information Processing Systems 37, 117762-117783, 2024
72024
Certifying safety in reinforcement learning under adversarial perturbation attacks
J Wu, H Sibai, Y Vorobeychik
2024 IEEE Security and Privacy Workshops (SPW), 57-67, 2024
32024
Learning generative deception strategies in combinatorial masking games
J Wu, C Kamhoua, M Kantarcioglu, Y Vorobeychik
International Conference on Decision and Game Theory for Security, 98-117, 2021
32021
From Personal to Collective: On the Role of Local and Global Memory in LLM Personalization
Z Wang, J Wu, ZH Tan, B Li, X Zhong, Z Liu, Q Zeng
arXiv preprint arXiv:2509.23767, 2025
12025
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning
J Wu, X Zhong, J Sun, B Li, B Jin, J Han, Q Zeng
arXiv preprint arXiv:2510.15191, 2025
2025
DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning
Y Wang, B Li, J Wu, Z Tan, Z Liu, R Zhang, A Grama, Q Zeng
arXiv preprint arXiv:2510.02341, 2025
2025
Learning Vision-Based Neural Network Controllers with Semi-Probabilistic Safety Guarantees
X Ma, J Wu, H Sibai, Y Kantaros, Y Vorobeychik
arXiv preprint arXiv:2503.00191, 2025
2025
Trustworthy Autonomy Through Robust Control and Alignment
J Wu
Washington University in St. Louis, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–15