Distributed RL platform with modified IMPALA architecture. Implements CLEAR, LASER V-trace modifications along with Attentive and Elite sampling experience replay methods.
machine-learning impala deep-reinforcement-learning pytorch policy-gradient arcade-learning-environment experience-replay distributed-reinforcement-learning actor-critic-with-experience-replay elite-sampling mixing-on-and-off-policy-data
-
Updated
Apr 8, 2022 - Python