Below contents from the original repository

Efficient & Optimal Vehicle Path tracking control using Model based Deep Reinforcement Learning which trains two Actor-Critic NNs that compute Optimal Steering control actions for following any path. To Validate the optimality of the trained parameterized control policy, the Actor NN's solution is compared with that provided by MPC(IPOPT) for the corresponding Optimal Control Problem.

As compared to the original repository, the objective function of both Actor-Critic NNs and MPC are kept exactly same and the NNs are retrained while adapting the hyper parameters. Following results were achieved which clearly shows improvement as compared to the baseline results obtained from the trained network already provided in the original repository. The baseline results shows oscillatory response in both heading angle error and steering action which is not present in the optimal solution provided by MPC, thus questioning the optimality of the previously provided trained NNs. With the retrained NNs the oscillatory behavior is almost eliminated and both the heading angle error & steering control is very close to the optimal solution provided by MPC. This validates that the retrained NNs provides optimal solution to the corresponding OCP.

The retrained network is in directory.

Many Thanks to Haitong Ma for opensourcing the below respository which helped me to understand ADP & Actor-Critic MBRL.

This is a modified version of the original respository.

Below contents from the original repository

Vehicle Tracking Control

Code demo for Chpater 8, Reinforcement Learning and Control.
Methods: Approximate Dynamic Programming, Model Predictive Control

Requirements

PyTorch 1.4.0

CasADi

Getting Started

To train an agent, follow the example code in main.py and tune the parameters. Change METHODS variable for adjusting the methods to compare in simulation stage.
Simulations will automatically executed after the training is finished. To separately start a simulation from a trained results and compare the performance between ADP and MPC, run simulation.py. Change LOG_DIR variable to set the loaded results.

Directory Structure

Approximate-Dynamic-Programming
│  main.py - Main script
│  plot.py - To plot comparison between ADP and MPC
│  train.py - To execute PEV and PIM
│  dynamics.py - Vehicle model
│  network.py - Network structure
│  solver.py - Solvers for MPC using CasADi
│  config.py - Configurations about training and vehicle model
│  simulation.py - Run experiment to compare ADP and MPC
│  readme.md
│  requirements.txt
│
├─Results_dir - store trained results
│     
└─Simulation_dir - store simulation data and plots

Related Books and Papers

Reinforcement Learning and Control. Tsinghua University Lecture Notes, 2020.

CasADi: a software framework for nonlinear optimization and optimal control

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Below contents from the original repository

Vehicle Tracking Control

Requirements

Getting Started

Directory Structure

Related Books and Papers

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
Simulation_dir		Simulation_dir
retrained_network/2021-12-27-13-49-10000		retrained_network/2021-12-27-13-49-10000
trained_results/2020-10-09-14-42-10000		trained_results/2020-10-09-14-42-10000
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dynamics.py		dynamics.py
main.py		main.py
network.py		network.py
plot.py		plot.py
requirements.txt		requirements.txt
simulation.py		simulation.py
solver.py		solver.py
train.py		train.py
utils.py		utils.py

License

saxenam06/Approximate-Dynamic-Programming

Folders and files

Latest commit

History

Repository files navigation

Below contents from the original repository

Vehicle Tracking Control

Requirements

Getting Started

Directory Structure

Related Books and Papers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages