- Tiny Maze : A 6 cell grid as an intro to tabular Q-Learning
- Frozen Lake : Temporal difference Q-Learning on FrozenLake 4x4 grid
- Taxi : Temporal difference Q-Learning on the toy Taxi Problem
- Space Invaders : Train a Deep Q-Learning Network to play Space Invaders
- Lunar Lander : Train a simple policy optimization network to land on the moon
- Cart Pole : Train a simple policy optimization network to to push the cart left or right so that the pole stays in the equilibrium