We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
13.3 DDPG 代码实践中,在定义的DDPG类中,方法def take_action(self, state):的返回动作应该加上截断。 return action -> return np.clip(action, -self.action_bound, self.action_bound) 该动作会用于Q网络对当前时间步的q值估计,动作不应大于环境的限制(添加的噪声会导致这种情况发生,尽管概率很小)。
def take_action(self, state):
return action
return np.clip(action, -self.action_bound, self.action_bound)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
13.3 DDPG 代码实践中,在定义的DDPG类中,方法
def take_action(self, state):
的返回动作应该加上截断。return action
->return np.clip(action, -self.action_bound, self.action_bound)
该动作会用于Q网络对当前时间步的q值估计,动作不应大于环境的限制(添加的噪声会导致这种情况发生,尽管概率很小)。
The text was updated successfully, but these errors were encountered: