MINERVA is an out-of-the-box GUI tool for offline deep reinforcement learning, designed for everyone including non-programmers to do reinforcement learning as a tool.
Documentation: https://minerva-ui.readthedocs.io
MINERVA only requires datasets to start offline deep reinforcement learning. Any combinations of vector observations and image observations with discrete actions and continuous actions are supported.
MINERVA provides designed with intuitive GUI to let everyone lerverage extremely powerful algorithms without barriers. The GUI is developed as a Single Page Application (SPA) to make it work in the eye-opening speed.
MINERVA is powered by d3rlpy, a powerful offline deep reinforcement learning library for Python, to provide extremely powerful algorithms in an out-of-the-box way. The trained policy can be exported as TorchScript and ONNX.
$ pip install minerva-ui
$ docker run -d --gpus all -p 9000:9000 --name minerva takuseno/minerva:latest
If you update MINERVA, the database schema should be also updated as follows:
$ pip install -U minerva-ui
$ minerva upgrade-db
At the first time, ~/.minerva
will be automatically created to store
database, uploaded datasets and training metrics.
$ minerva run
By default, you can access to MINERVA interface at http://localhost:9000 .
You can change the host and port with --host
and --port
arguments
respectively.
You can delete entire data (~/.minerva
) as follows:
$ minerva clean
$ npm install
$ npm run build
This repository is fully formatted with yapf and standard. You can format the entire scripts as follows:
$ ./scripts/format
This repository is fully analyzed with Pylint, ESLint and sass-lint. You can run analysis as follows:
$ ./scripts/lint
The unit tests are provided as much as possible.
This repository is using pytest-cov
instead of pytest
.
You can run the entire tests as follows:
$ ./scripts/test
This work is supported by Information-technology Promotion Agency, Japan (IPA), Exploratory IT Human Resources Project (MITOU Program) in the fiscal year 2020.
The concept of the GUI software for deep reinforcement learning is inspired by DeepAnalyzer from Ghelia inc. I'm showing the great respect to the team here.