[go: up one dir, main page]

Zhang et al., 2024 - Google Patents

Versatile navigation under partial observability via value-guided diffusion policy

Zhang et al., 2024

View PDF
Document ID
2448210687823659187
Author
Zhang G
Tang H
Yan Y
Publication year
Publication venue
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

External Links

Snippet

Route planning for navigation under partial observability plays a crucial role in modern robotics and autonomous driving. Existing route planning approaches can be categorized into two main classes: traditional autoregressive and diffusion-based methods. The former …
Continue reading at openaccess.thecvf.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • G06N5/04Inference methods or devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6296Graphical models, e.g. Bayesian networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6279Classification techniques relating to the number of classes
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computer systems based on specific mathematical models
    • G06N7/005Probabilistic networks

Similar Documents

Publication Publication Date Title
Pertsch et al. Accelerating reinforcement learning with learned skill priors
Morales et al. A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
Qureshi et al. Motion planning networks
Chaplot et al. Learning to explore using active neural slam
Ebert et al. Self-Supervised Visual Planning with Temporal Skip Connections.
Amarjyoti Deep reinforcement learning for robotic manipulation-the state of the art
Tai et al. A survey of deep network solutions for learning control in robotics: From reinforcement to imitation
Yu et al. Learning efficient multi-agent cooperative visual exploration
Wu et al. Reinforcement learning-based visual navigation with information-theoretic regularization
Zhang et al. Versatile navigation under partial observability via value-guided diffusion policy
CN114261400B (en) Automatic driving decision method, device, equipment and storage medium
Liu et al. Benchmarking constraint inference in inverse reinforcement learning
Bitzer et al. Using dimensionality reduction to exploit constraints in reinforcement learning
CN116360435A (en) Training method and system for multi-agent cooperative strategy based on episodic memory
CN118061186A (en) Robot planning method and system based on multi-mode large model predictive control
Monteiro et al. Augmented behavioral cloning from observation
Kumar et al. Gcexp: Goal-conditioned exploration for object goal navigation
Messikommer et al. Contrastive initial state buffer for reinforcement learning
Antonyshyn et al. Deep model-based reinforcement learning for predictive control of robotic systems with dense and sparse rewards
Jurgenson et al. Sub-Goal Trees--a Framework for Goal-Directed Trajectory Prediction and Optimization
US20250162150A1 (en) Action planning for robot control
CN119714278A (en) Robot path planning method and system based on improved ant colony algorithm and DWA fusion
Wang et al. Guided cooperation in hierarchical reinforcement learning via model-based rollout
Parisotto Meta reinforcement learning through memory
Seo et al. PRESTO: Fast motion planning using diffusion models based on key-configuration environment representation