Simple implementation of double Q-Learning based on MorvanZhou's RL code
https://papers.nips.cc/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
python double_q_learning.py
- You can adjust FRESH_TIME and SHOW_TIME to slow down the game process
- You can uncommon the print line to see how Q-table is iterated