- Introduce Multi-Armed Bandit (MBA) problem to fellow students as a classic problem in reinforcement learning
- Implement several strategies for maximizing outcomes for MAB
- Compare performance for different strategies to analyze which one is more suitable given a situation