Zhang et al., 2024 - Google Patents

Versatile navigation under partial observability via value-guided diffusion policy

Zhang et al., 2024

Document ID: 2448210687823659187
Author: Zhang G; Tang H; Yan Y
Publication year: 2024
Publication venue: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

External Links

Cited by

Snippet

Route planning for navigation under partial observability plays a crucial role in modern robotics and autonomous driving. Existing route planning approaches can be categorized into two main classes: traditional autoregressive and diffusion-based methods. The former …

Continue reading at openaccess.thecvf.com (PDF) (other versions)

238000009792 diffusion process 0 title abstract description 83

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6296—Graphical models, e.g. Bayesian networks
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks

Similar Documents

Publication	Publication Date	Title
Pertsch et al.	2021	Accelerating reinforcement learning with learned skill priors
Morales et al.	2021	A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Qureshi et al.	2019	Motion planning networks
Chaplot et al.	2020	Learning to explore using active neural slam
Ebert et al.	2017	Self-Supervised Visual Planning with Temporal Skip Connections.
Amarjyoti	2017	Deep reinforcement learning for robotic manipulation-the state of the art
Tai et al.	2016	A survey of deep network solutions for learning control in robotics: From reinforcement to imitation
Yu et al.	2022	Learning efficient multi-agent cooperative visual exploration
Wu et al.	2021	Reinforcement learning-based visual navigation with information-theoretic regularization
Zhang et al.	2024	Versatile navigation under partial observability via value-guided diffusion policy
CN114261400B (en)	2024-06-14	Automatic driving decision method, device, equipment and storage medium
Liu et al.	2022	Benchmarking constraint inference in inverse reinforcement learning
Bitzer et al.	2010	Using dimensionality reduction to exploit constraints in reinforcement learning
CN116360435A (en)	2023-06-30	Training method and system for multi-agent cooperative strategy based on episodic memory
CN118061186A (en)	2024-05-24	Robot planning method and system based on multi-mode large model predictive control
Monteiro et al.	2020	Augmented behavioral cloning from observation
Kumar et al.	2021	Gcexp: Goal-conditioned exploration for object goal navigation
Messikommer et al.	2024	Contrastive initial state buffer for reinforcement learning
Antonyshyn et al.	2024	Deep model-based reinforcement learning for predictive control of robotic systems with dense and sparse rewards
Jurgenson et al.	2019	Sub-Goal Trees--a Framework for Goal-Directed Trajectory Prediction and Optimization
US20250162150A1 (en)	2025-05-22	Action planning for robot control
CN119714278A (en)	2025-03-28	Robot path planning method and system based on improved ant colony algorithm and DWA fusion
Wang et al.	2024	Guided cooperation in hierarchical reinforcement learning via model-based rollout
Parisotto	2021	Meta reinforcement learning through memory
Seo et al.	2024	PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation