num_parallel affecting learning results #229

spicytomatoes · 2021-12-14T17:09:53Z

hi, I've tried training on a 32 core machine, naturally i set num_parallel to 32. However the model does not seem to learn at all. Weirdly, when i set num_parallel to 6, the model learns.
The rest of the config is exactly the same as the PubHRL config for hungry geese.

YuriCat · 2022-01-08T04:45:37Z

Thanks for your report!
We ran several experiments with 64 workers, and all the training was successful.
However, it is not easy to learn non-legal moves in this task, and I am sure that training is not stable.

If there is one thing I can say, it is that the PubHRL experiment setup was decided on the first try, so I cannot recommend it with confidence.
As I mentioned in the discussion, I think forward_steps=1 is generally better in this kind of task. Also, a larger entropy regularization coefficient would be better.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

num_parallel affecting learning results #229

num_parallel affecting learning results #229

spicytomatoes commented Dec 14, 2021

YuriCat commented Jan 8, 2022

num_parallel affecting learning results #229

num_parallel affecting learning results #229

Comments

spicytomatoes commented Dec 14, 2021

YuriCat commented Jan 8, 2022