Oversized return value for get_action #5

rlipkis · 2021-01-14T00:49:14Z

The function get_action (for the DRL solvers) returns rand(distribution, policy.solver.action_size), but since distribution is already multivariate, this allocates and fills a square matrix with samples (of which only the first is relevant / used) -- I think the intended return value is rand(distribution). I can submit a pull request if that's preferred.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Oversized return value for get_action #5

Oversized return value for get_action #5

rlipkis commented Jan 14, 2021

Oversized return value for get_action #5

Oversized return value for get_action #5

Comments

rlipkis commented Jan 14, 2021