You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I notice that in the demo DQN example, there is a hyperparameter action_interval , its default value is 2. If that, it will break up the assumption of Markov Decision Process (MDP), i.e., the next state for each action is not the exact next state.
Also, what's the suggested action_interval will be good enough for the specific problem?
The text was updated successfully, but these errors were encountered:
action_interval=2 means that each 2*10 = 20 seconds, the agent makes a decision, you can try different integer action_interval values (for example, 1, 2,..), but please be notified the minimum action_interval = 1 (make a decision every 10 seconds).
Dear author,
I notice that in the demo DQN example, there is a hyperparameter
action_interval
, its default value is 2. If that, it will break up the assumption of Markov Decision Process (MDP), i.e., thenext state
for each action is not the exactnext state
.Also, what's the suggested
action_interval
will be good enough for the specific problem?The text was updated successfully, but these errors were encountered: