DDPG / DQN Highly Unstable
Added: * Memory RAM size reduction via cleaning on item input. Fixed: * DDPG is stable now. Works on Pendulum as expected / desired Notes: * Now that DDPG works as expected, we will move to preparing repo for version 1.0. This will involve testing / CI and passing expected benchmarks.