Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 413 Bytes

File metadata and controls

8 lines (6 loc) · 413 Bytes

Adapt the vacuum world (Chapter agents-chapter for reinforcement learning by including rewards for squares being clean. Make the world observable by providing suitable percepts. Now experiment with different reinforcement learning agents. Is function approximation necessary for success? What sort of approximator works for this application?