Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standalone implementations of reinforcement learning algorithms #8

Open
jean72human opened this issue Apr 3, 2019 · 5 comments
Open

Comments

@jean72human
Copy link

Hello,
I would like to know what you think about having some standalone implementations as functions that take in the environment and other parameters and return the trained policy.

Here an example of how this could look like for deep Q learning:

policy = deepq.learn(env, network=q_function_approximator, lr=learning_rate, epsilon=exploration_rate, buffer_size=buffer_size)

I think this would make it easier to quickly get started with deep reinforcement learning with Flux

@tejank10
Copy link
Contributor

tejank10 commented Apr 4, 2019

It'd be great to have a sister package, like what baselines is to OpenAI Gym, showing off RL algorithms with Gym.jl

@jean72human
Copy link
Author

Should it be a seperate package or can it be included in Gym.jl?

@v-i-s-h
Copy link

v-i-s-h commented Apr 5, 2019

I'm also really interested to see this happening. Making it a separate package will be useful, I think.
Otherwise, users who want to use Gym.jl alone will also have to download the dependencies of this.

Further, which framework you propose will be best suited for this: Knet, Flux or Tensorflow.jl?

@tejank10
Copy link
Contributor

tejank10 commented Apr 5, 2019

Making separate package is useful, for the same reasons mentioned by @v-i-s-h . Any of the framework can be used, but I'll be biased towards Flux ;)

@jean72human
Copy link
Author

I think Flux would be good for that

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants