-
Notifications
You must be signed in to change notification settings - Fork 453
memory for state #14
Comments
|
Thanks for question 1- I did not understand how to do it? |
@fi000 can you provide a link to the said paper.
|
@pskrunner14 You can refer to paper "Playing Atari with Deep Reinforcement Learning" section 4.1 last sentences of first paragraph! I have an state with 5 inputs but I have a problem in giving for instance 4 states in a frame! How we could do this in this code? |
thanks Keon for your great code!
I have two questions:
1- What does [0] means in self.model.predict(next_state)[0] and return np.argmin(act_values[0])? Does this mean that first element of batch?
2-If in addition to batch, I need that my state is the state from K times before, what is the necessary change in order to do this? I want to send the state=state[i-k+1]....state[i-1],state[i] not only one state! How I can do this?
Thanks again
The text was updated successfully, but these errors were encountered: