Standalone implementations of reinforcement learning algorithms #8

jean72human · 2019-04-03T01:16:05Z

Hello,
I would like to know what you think about having some standalone implementations as functions that take in the environment and other parameters and return the trained policy.

Here an example of how this could look like for deep Q learning:

policy = deepq.learn(env, network=q_function_approximator, lr=learning_rate, epsilon=exploration_rate, buffer_size=buffer_size)

I think this would make it easier to quickly get started with deep reinforcement learning with Flux

The text was updated successfully, but these errors were encountered:

tejank10 · 2019-04-04T11:45:10Z

It'd be great to have a sister package, like what baselines is to OpenAI Gym, showing off RL algorithms with Gym.jl

jean72human · 2019-04-05T09:08:10Z

Should it be a seperate package or can it be included in Gym.jl?

v-i-s-h · 2019-04-05T12:00:18Z

I'm also really interested to see this happening. Making it a separate package will be useful, I think.
Otherwise, users who want to use Gym.jl alone will also have to download the dependencies of this.

Further, which framework you propose will be best suited for this: Knet, Flux or Tensorflow.jl?

tejank10 · 2019-04-05T18:47:59Z

Making separate package is useful, for the same reasons mentioned by @v-i-s-h . Any of the framework can be used, but I'll be biased towards Flux ;)

jean72human · 2019-04-07T02:25:57Z

I think Flux would be good for that

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standalone implementations of reinforcement learning algorithms #8

Standalone implementations of reinforcement learning algorithms #8

jean72human commented Apr 3, 2019

tejank10 commented Apr 4, 2019

jean72human commented Apr 5, 2019

v-i-s-h commented Apr 5, 2019

tejank10 commented Apr 5, 2019

jean72human commented Apr 7, 2019

Standalone implementations of reinforcement learning algorithms #8

Standalone implementations of reinforcement learning algorithms #8

Comments

jean72human commented Apr 3, 2019

tejank10 commented Apr 4, 2019

jean72human commented Apr 5, 2019

v-i-s-h commented Apr 5, 2019

tejank10 commented Apr 5, 2019

jean72human commented Apr 7, 2019