We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
https://datawhalechina.github.io/easy-rl/#/chapter6/chapter6_questions&keywords
Description
The text was updated successfully, but these errors were encountered:
Thanks♪(・ω・)ノ
Sorry, something went wrong.
这个地方不是很理解:习题6-3(4)中:“所以对于时序差分方法来说,rr 是一个随机变量。”习题6-5中:“我们希望它们两个相减的损失值与 r_tr t 尽可能地接近。这也是网络的优化目标,我们称之为损失函数。”所以DQN的优化目标是拟合随机变量?
No branches or pull requests
https://datawhalechina.github.io/easy-rl/#/chapter6/chapter6_questions&keywords
Description
The text was updated successfully, but these errors were encountered: