Code implementation for the RA-L and ICRA 21 paper 'Learning from Imperfect Demonstrations from Agents with Varying Dynamics' Arxiv. (Under developping)
pip install -r requirement.txt
cd imperfect_envs
pip ininstall -e .
Download demonstrations here
Extract the demonstrations here.
tar -zxf codes.tar.gz
Traning commands are in train.sh. We annotate the experiment name as a comment before the command.
The TRPO part is hugely based on: https://github.com/ikostrikov/pytorch-trpo