Zhang et al., 2024 - Google Patents
Versatile navigation under partial observability via value-guided diffusion policyZhang et al., 2024
View PDF- Document ID
- 2448210687823659187
- Author
- Zhang G
- Tang H
- Yan Y
- Publication year
- Publication venue
- Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
External Links
Snippet
Route planning for navigation under partial observability plays a crucial role in modern robotics and autonomous driving. Existing route planning approaches can be categorized into two main classes: traditional autoregressive and diffusion-based methods. The former …
- 238000009792 diffusion process 0 title abstract description 83
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6296—Graphical models, e.g. Bayesian networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6279—Classification techniques relating to the number of classes
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Pertsch et al. | Accelerating reinforcement learning with learned skill priors | |
| Morales et al. | A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning | |
| Qureshi et al. | Motion planning networks | |
| Chaplot et al. | Learning to explore using active neural slam | |
| Ebert et al. | Self-Supervised Visual Planning with Temporal Skip Connections. | |
| Amarjyoti | Deep reinforcement learning for robotic manipulation-the state of the art | |
| Tai et al. | A survey of deep network solutions for learning control in robotics: From reinforcement to imitation | |
| Yu et al. | Learning efficient multi-agent cooperative visual exploration | |
| Wu et al. | Reinforcement learning-based visual navigation with information-theoretic regularization | |
| Zhang et al. | Versatile navigation under partial observability via value-guided diffusion policy | |
| CN114261400B (en) | Automatic driving decision method, device, equipment and storage medium | |
| Liu et al. | Benchmarking constraint inference in inverse reinforcement learning | |
| Bitzer et al. | Using dimensionality reduction to exploit constraints in reinforcement learning | |
| CN116360435A (en) | Training method and system for multi-agent cooperative strategy based on episodic memory | |
| CN118061186A (en) | Robot planning method and system based on multi-mode large model predictive control | |
| Monteiro et al. | Augmented behavioral cloning from observation | |
| Kumar et al. | Gcexp: Goal-conditioned exploration for object goal navigation | |
| Messikommer et al. | Contrastive initial state buffer for reinforcement learning | |
| Antonyshyn et al. | Deep model-based reinforcement learning for predictive control of robotic systems with dense and sparse rewards | |
| Jurgenson et al. | Sub-Goal Trees--a Framework for Goal-Directed Trajectory Prediction and Optimization | |
| US20250162150A1 (en) | Action planning for robot control | |
| CN119714278A (en) | Robot path planning method and system based on improved ant colony algorithm and DWA fusion | |
| Wang et al. | Guided cooperation in hierarchical reinforcement learning via model-based rollout | |
| Parisotto | Meta reinforcement learning through memory | |
| Seo et al. | PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation |