Reinforcement-Learning-Cliff-Walking

This repo contains python implementation to the cliff walking problem from RL Introduction by Sutton & Barto Example 6.6.

The purpose is to implement TD(0) policy evaluation and also Q-Learning, Expected Sarsa for policy control.

Rules

A standard undiscounted, episodic task, with start and goal states, and the usual actions causing movement up, down,
right, and left. Reward is 1 on all transitions except those into the region marked “The Cli↵.”
Stepping into this region incurs a reward of 100 and sends the agent instantly back to the start.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
agent		agent
data		data
environment		environment
README.md		README.md
abstract_classes.py		abstract_classes.py
main.py		main.py
plot.py		plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning-Cliff-Walking

Table of Contents

Rules

RL algorithms

TD Zero

Q Learning

Expected Sarsa

About

Releases

Packages

Languages

John-CYHui/Reinforcement-Learning-Cliff-Walking

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning-Cliff-Walking

Table of Contents

Rules

RL algorithms

TD Zero

Q Learning

Expected Sarsa

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages