Code implementation for the RA-L and ICRA 21 paper 'Learning from Imperfect Demonstrations from Agents with Varying Dynamics' Arxiv. (Under developping)

Installation

pip install -r requirement.txt

cd imperfect_envs

pip ininstall -e .

Demonstrations

Download demonstrations here

Extract the demonstrations here. tar -zxf codes.tar.gz

Training

Traning commands are in train.sh. We annotate the experiment name as a comment before the command.

Testing

Acknowledgement

The TRPO part is hugely based on: https://github.com/ikostrikov/pytorch-trpo

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
imperfect_envs		imperfect_envs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conjugate_gradients.py		conjugate_gradients.py
loss.py		loss.py
models.py		models.py
replay_memory.py		replay_memory.py
requirements.txt		requirements.txt
running_state.py		running_state.py
train.sh		train.sh
train_driving.py		train_driving.py
train_reacher.py		train_reacher.py
trpo.py		trpo.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code implementation for the RA-L and ICRA 21 paper 'Learning from Imperfect Demonstrations from Agents with Varying Dynamics' Arxiv. (Under developping)

Installation

Demonstrations

Training

Testing

Acknowledgement

About

Releases

Packages

Languages

License

Stanford-ILIAD/Learn-Imperfect-Varying-Dynamics

Folders and files

Latest commit

History

Repository files navigation

Code implementation for the RA-L and ICRA 21 paper 'Learning from Imperfect Demonstrations from Agents with Varying Dynamics' Arxiv. (Under developping)

Installation

Demonstrations

Training

Testing

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages