| End-to-end autonomous driving: Challenges and frontiers L Chen, P Wu, K Chitta, B Jaeger, A Geiger, H Li IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 753 | 2024 |
| Cambrian-1: A fully open, vision-centric exploration of multimodal llms P Tong, E Brown, P Wu, S Woo, AJV IYER, SC Akula, S Yang, J Yang, ... Advances in Neural Information Processing Systems 37, 87310-87356, 2024 | 709 | 2024 |
| St-p3: End-to-end vision-based autonomous driving via spatial-temporal feature learning S Hu, L Chen, P Wu, H Li, J Yan, D Tao European Conference on Computer Vision, 533-549, 2022 | 449 | 2022 |
| Trajectory-guided control prediction for end-to-end autonomous driving: A simple yet strong baseline P Wu, X Jia, L Chen, J Yan, H Li, Y Qiao Advances in Neural Information Processing Systems 35, 6119-6132, 2022 | 345 | 2022 |
| V?: Guided visual search as a core mechanism in multimodal llms P Wu, S Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 286 | 2024 |
| Hdgt: Heterogeneous driving graph transformer for multi-agent trajectory prediction via scene encoding X Jia, P Wu, L Chen, Y Liu, H Li, J Yan IEEE transactions on pattern analysis and machine intelligence 45 (11 …, 2023 | 222 | 2023 |
| Think twice before driving: Towards scalable decoders for end-to-end autonomous driving X Jia, P Wu, L Chen, J Xie, C He, J Yan, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 189 | 2023 |
| Generalized predictive model for autonomous driving J Yang, S Gao, Y Qiu, L Chen, T Li, B Dai, K Chitta, P Wu, J Zeng, P Luo, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 142 | 2024 |
| Video-mmmu: Evaluating knowledge acquisition from multi-discipline professional videos K Hu, P Wu, F Pu, W Xiao, Y Zhang, X Yue, B Li, Z Liu arXiv preprint arXiv:2501.13826, 2025 | 92 | 2025 |
| Policy pre-training for autonomous driving via self-supervised geometric modeling P Wu, L Chen, H Li, X Jia, J Yan, Y Qiao arXiv preprint arXiv:2301.01006, 2023 | 49 | 2023 |
| Towards capturing the temporal dynamics for trajectory prediction: a coarse-to-fine approach X Jia, L Chen, P Wu, J Zeng, J Yan, H Li, Y Qiao Conference on Robot Learning, 910-920, 2023 | 45 | 2023 |
| Level 2 autonomous driving on a single device: Diving into the devils of openpilot L Chen, T Tang, Z Cai, Y Li, P Wu, H Li, J Shi, J Yan, Y Qiao arXiv preprint arXiv:2206.08176, 2022 | 24 | 2022 |
| Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning S Tian, R Wang, H Guo, P Wu, Y Dong, X Wang, J Yang, H Zhang, H Zhu, ... arXiv preprint arXiv:2506.13654, 2025 | 15 | 2025 |
| GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior P Wu, S Ma, B Wang, J Yu, L Lu, Z Liu arXiv preprint arXiv:2506.08012, 2025 | 11 | 2025 |
| Inharmonious region localization by magnifying domain discrepancy J Liang, L Niu, P Wu, F Guo, T Long Proceedings of the AAAI conference on artificial intelligence 36 (2), 1574-1582, 2022 | 9 | 2022 |
| Visual jigsaw post-training improves mllms P Wu, Y Zhang, H Diao, B Li, L Lu, Z Liu arXiv preprint arXiv:2509.25190, 2025 | 6 | 2025 |
| Video-mmmu: Evaluating knowledge acquisition from multi-discipline professional videos.(2025) K Hu, P Wu, F Pu, W Xiao, Y Zhang, X Yue, B Li, Z Liu URL https://arxiv. org/abs/2501.13826 3, 0 | 6 | |
| Inharmonious region localization with auxiliary style feature P Wu, L Niu, L Zhang arXiv preprint arXiv:2210.02029, 2022 | 5 | 2022 |
| Inharmonious Region Localization via Recurrent Self-Reasoning P Wu, L Niu, J Liang, L Zhang arXiv preprint arXiv:2210.02036, 2022 | 2 | 2022 |
| Streamline Without Sacrifice--Squeeze out Computation Redundancy in LMM P Wu, L Lu, Z Liu arXiv preprint arXiv:2505.15816, 2025 | 1 | 2025 |