Skip to content

An implementation of Meta-RL utilizing the A3C architecture

Notifications You must be signed in to change notification settings

JamesLawton/Meta-RL_Bandit-Problems

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

in4155-2017-james-lawton

Learning to reinforcement learn

A repository for the purpose of recreating and expanding upon the experiments mentioned in Learning to Reinforcement Learn (Wang, et al. 2016) Specifically, this repository focuses on the implementation of a series of bandit problems (easy, medium, and hard) for generalization purposes. Additionally, Meta-RL enables the agent to continue learning, even while the weights are frozen.

Implementation inspired from Arthur Juliani, please see blog post for further details.

About

An implementation of Meta-RL utilizing the A3C architecture

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published