in4155-2017-james-lawton

Learning to reinforcement learn

A repository for the purpose of recreating and expanding upon the experiments mentioned in Learning to Reinforcement Learn (Wang, et al. 2016) Specifically, this repository focuses on the implementation of a series of bandit problems (easy, medium, and hard) for generalization purposes. Additionally, Meta-RL enables the agent to continue learning, even while the weights are frozen.

Implementation inspired from Arthur Juliani, please see blog post for further details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
Bandit Experiments - MetaRL.ipynb		Bandit Experiments - MetaRL.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

in4155-2017-james-lawton

About

Releases

Packages

Languages

JamesLawton/Meta-RL_Bandit-Problems

Folders and files

Latest commit

History

Repository files navigation

in4155-2017-james-lawton

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages