Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why does GAIL get lower rewards the more it is trained? #36

Open
ZXAXKL opened this issue Jun 25, 2023 · 1 comment
Open

Why does GAIL get lower rewards the more it is trained? #36

ZXAXKL opened this issue Jun 25, 2023 · 1 comment

Comments

@ZXAXKL
Copy link

ZXAXKL commented Jun 25, 2023

Hi, thank you for the baseline code, it helps me a lot. But I have a little problem with running it. I first sample data through the trained expert strategy, and then provide it to GAIL, but in the environments of Ant-v2 and Hopper-v2, the rewards will get lower and lower as the number of training increases. My environment is mujoco.py=2.0.8 and mujoco200. I would be very grateful if you could take the time to look into the problem for me.
16571687510554_ pic
16401687509779_ pic

@Yangning-k
Copy link

Hello, I have also encountered this problem, may I ask if you have solved it ? Thank u.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants