[go: up one dir, main page]

Follow
Jiayang Song
Title
Cited by
Cited by
Year
Look before you leap: An exploratory study of uncertainty measurement for large language models
Y Huang, J Song, Z Wang, S Zhao, H Chen, F Juefei-Xu, L Ma
arXiv preprint arXiv:2307.10236, 2023
1932023
Isr-llm: Iterative self-refined large language model for long-horizon sequential task planning
Z Zhou, J Song, K Yao, Z Shu, L Ma
2024 IEEE International Conference on Robotics and Automation (ICRA), 2081-2088, 2024
1342024
Self-refined large language model as automated reward function designer for deep reinforcement learning in robotics
J Song, Z Zhou, J Liu, C Fang, Z Shu, L Ma
arXiv preprint arXiv:2309.06687, 2023
572023
Look Before You Leap: An Exploratory Study of Uncertainty Analysis for Large Language Models
Y Huang, J Song, Z Wang, S Zhao, H Chen, F Juefei-Xu, L Ma
IEEE Transactions on Software Engineering, 2025
462025
Towards building AI-CPS with NVIDIA Isaac Sim: An industrial benchmark and case study for robotics manipulation
Z Zhou, J Song, X Xie, Z Shu, L Ma, D Liu, J Yin, S See
Proceedings of the 46th international conference on software engineering …, 2024
432024
When cyber-physical systems meet AI: a benchmark, an evaluation, and a way forward
J Song, D Lyu, Z Zhang, Z Wang, T Zhang, L Ma
Proceedings of the 44th International Conference on Software Engineering …, 2022
412022
Luna: A model-based universal analysis framework for large language models
D Song, X Xie, J Song, D Zhu, Y Huang, F Juefei-Xu, L Ma
IEEE Transactions on Software Engineering 50 (7), 1921-1948, 2024
262024
Towards understanding retrieval accuracy and prompt quality in rag systems
S Zhao, Y Huang, J Song, Z Wang, C Wan, L Ma
arXiv preprint arXiv:2411.19463, 2024
222024
Multilingual blending: Llm safety alignment evaluation with language mixture
J Song, Y Huang, Z Zhou, L Ma
arXiv preprint arXiv:2407.07342, 2024
172024
Mosaic: Model-based safety analysis framework for AI-enabled cyber-physical systems
X Xie, J Song, Z Zhou, F Zhang, L Ma
arXiv preprint arXiv:2305.03882, 2023
162023
Online safety analysis for llms: a benchmark, an assessment, and a path forward
X Xie, J Song, Z Zhou, Y Huang, D Song, L Ma
IEEE Transactions on Artificial Intelligence, 2025
152025
Autorepair: Automated repair for ai-enabled cyber-physical systems under safety-critical conditions
D Lyu, J Song, Z Zhang, Z Wang, T Zhang, L Ma, J Zhao
arXiv preprint arXiv:2304.05617, 2023
152023
: A Semantics-Guided Safety Enhancement Framework for AI-Enabled Cyber-Physical Systems
J Song, X Xie, L Ma
IEEE Transactions on Software Engineering 49 (8), 4058-4080, 2023
142023
Towards testing and evaluating vision-language-action models for robotic manipulation: An empirical study
Z Wang, Z Zhou, J Song, Y Huang, Z Shu, L Ma
arXiv e-prints, arXiv: 2409.12894, 2024
132024
VLATest: Testing and Evaluating Vision-Language-Action Models for Robotic Manipulation
Z Wang, Z Zhou, J Song, Y Huang, Z Shu, L Ma
Proceedings of the ACM on Software Engineering 2 (FSE), 1615-1638, 2025
102025
MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems
R Wang, Z Zhou, J Song, X Xie, X Xie, L Ma
arXiv preprint arXiv:2408.03892, 2024
72024
Active testing of large language model via multi-stage sampling
Y Huang, J Song, Q Hu, F Juefei-Xu, L Ma
arXiv preprint arXiv:2408.03573, 2024
72024
Multilingual Blending: Large Language Model Safety Alignment Evaluation with Language Mixture
J Song, Y Huang, Z Zhou, L Ma
Findings of the Association for Computational Linguistics: NAACL 2025, 3433-3449, 2025
62025
LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation
Z Wang, Z Zhou, J Song, Y Huang, Z Shu, L Ma
arXiv preprint arXiv:2410.05191, 2024
62024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Z Zhou, X Xie, J Song, Z Shu, L Ma
IEEE Transactions on Neural Networks and Learning Systems, 2024
42024
The system can't perform the operation now. Try again later.
Articles 1–20