AlwaysSafe

Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training" — Thiago D. Simão, Nils Jansen and Matthijs T. J. Spaan, published at AAMAS 2021.

[Details]

modules

agents: model based RL agents that interact with the environment.
planners: the planners used by the RL agents to compute the policy in each episode.
scripts: each file is related to one of the experiments from the paper.
tests: mostly unittest scripts.
util: contains common scripts to train an RL agent and evaluate a policy.

lp solver

By default, the code uses gurobipy if found, otherwise it uses cvxpy.

usage

install dependencies
```
pipenv install
```
run tests
```
pipenv run python -m unittest
```

reproduce the experiments

pipenv run python -m scripts.simple
pipenv run python -m scripts.factored
pipenv run python -m scripts.cliff_walking

citing

@inproceedings{Simao2021alwayssafe,
  author    = {Sim{\~a}o, Thiago D. and Jansen, Nils and Spaan, Matthijs T. J.},
  title     = {AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training},
  year      = {2021},
  booktitle = {Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS)},
  publisher = {IFAAMAS},
  pages     = {1226–1235},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agents		agents
planners		planners
scripts		scripts
tests		tests
util		util
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlwaysSafe

modules

lp solver

usage

citing

About

Releases

Packages

Languages

License

AlgTUDelft/AlwaysSafe

Folders and files

Latest commit

History

Repository files navigation

AlwaysSafe

modules

lp solver

usage

citing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages