Notes for Sutton & Barto Book (2018)

Here's my notes and Python implementation on some classical and deep RL algorithms mentioned in the book.

Chapter 4: Dynamic Programming

Chapter 5: Monte Carlo Methods

Chapter 6: Temporal-Difference Learning

Chapter 7: n-step Bootstrapping

Chapter 8: Planning and Learning with Tabular Methods

Chapter 9: On-policy Prediction with Approximation

Chapter 9: On-policy Control with Approximation