notebooks

Q-Learning

Tiny Maze : A 6 cell grid as an intro to tabular Q-Learning
Frozen Lake : Temporal difference Q-Learning on FrozenLake 4x4 grid
Taxi : Temporal difference Q-Learning on the toy Taxi Problem

Deep Q-Learning

Space Invaders : Train a Deep Q-Learning Network to play Space Invaders

Policy Optimization

Lunar Lander : Train a simple policy optimization network to land on the moon
Cart Pole : Train a simple policy optimization network to to push the cart left or right so that the pole stays in the equilibrium