Skip to content

Latest commit

 

History

History
16 lines (10 loc) · 722 Bytes

README.md

File metadata and controls

16 lines (10 loc) · 722 Bytes

notebooks

Q-Learning

  • Tiny Maze : A 6 cell grid as an intro to tabular Q-Learning
  • Frozen Lake : Temporal difference Q-Learning on FrozenLake 4x4 grid
  • Taxi : Temporal difference Q-Learning on the toy Taxi Problem

Deep Q-Learning

  • Space Invaders : Train a Deep Q-Learning Network to play Space Invaders

Policy Optimization

  • Lunar Lander : Train a simple policy optimization network to land on the moon
  • Cart Pole : Train a simple policy optimization network to to push the cart left or right so that the pole stays in the equilibrium