ppo_pytorch :

A simple implementation of Clipped Proximal Policy Optimization in pytorch that runs in gym envs. This library also contains some weird additions, shortcuts and experimental stuff like truncated distributions, fixed std on the policy network (suprisingly works quite well) and full episode rollouts so it may not always marry up precisely with openai baselines.

This guy has been training for 50409 16 episode rollouts in the episode shown he scored 284.6

Installation:

ideally make yourself a virtualenv so i don't fuck up your torch install or whatever and then do:

 git clone https://github.com/leaprovenzano/ppo_pytorch.git
 pip install -e ppo_pytorch

Super basic example :

COMING SOON ... a notebook or something, soz!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ppo_pytorch :

Installation:

Super basic example :

Files

README.md

Latest commit

History

README.md

File metadata and controls

ppo_pytorch :

Installation:

Super basic example :