Reinforcement Learning Implementation Inspired by Bilibili Professor Zhao Shiyu's Lecture at Westlake University
- Bellman Equation
- Value Iteration with Bellman
- Policy Iteration with Bellman
- Policy Iteration with Monte Carlo
- Stochastic Gradient Descent (SGD), Batch Gradient Descent (BGD), and Mini-Batch Gradient Descent (MBGD)
- Q-learning
- Deep Q-learning
- REINFORCE
Feel free to discuss any questions or ideas!
Original Code - Please Cite the Source if Referenced