-
-
JaxMARL Public
Forked from FLAIROx/JaxMARLMulti-Agent Reinforcement Learning with JAX
-
pgx Public
Forked from sotetsuk/pgx🎲 Vectorized RL game environments written in JAX with end-to-end AlphaZero examples
-
MHRW Public
metropolis-hastings random walk with PySpark
-
Prioritized Experience Replay implementation with proportional prioritization
-
Minari Public
Forked from Farama-Foundation/MinariA standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Python Other UpdatedMay 31, 2023 -
sac-n-jax Public
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
-
memory-maze Public
Forked from jurgisp/memory-mazeEvaluating long-term memory of reinforcement learning algorithms
-
faster-trajectory-transformer Public archive
Implementation of Trajectory Transformer with attention caching and batched beam search
-
link_pred Public
link prediction in social network based on node neighborhoods
Jupyter Notebook UpdatedSep 30, 2022 -
link_pred_spark Public
similarity between graph nodes based on local information with PySpark
-
chess_minimax Public
minimax algorithm for chess with alpha-beta pruning
-
d4rl Public
Forked from Farama-Foundation/D4RLA benchmark for offline reinforcement learning.
Python Apache License 2.0 UpdatedAug 17, 2022 -
cleanrl Public
Forked from vwxyzjn/cleanrlHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
average_reward_ppo Public
Implementation of "Average-Reward Reinforcement Learning with Trust Region Methods" paper.
-
cic_gym Public
Adaptation of original "Contrastive Intrinsic Control for Unsupervised Skill Discovery" implementation to OpenAI Gym
-
-
halfcheetah_experts Public
expert policies for forward and backflip halfcheetah envs
-
vector-quantize-pytorch Public
Forked from lucidrains/vector-quantize-pytorchVector Quantization, in Pytorch
Python MIT License UpdatedMar 10, 2022 -
mujoco-py Public
Forked from openai/mujoco-pyMuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Cython Other UpdatedMar 1, 2022 -
cic Public
Forked from rll-research/cicCIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Python UpdatedFeb 24, 2022 -
robosuite Public
Forked from ARISE-Initiative/robosuiterobosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Python MIT License UpdatedDec 27, 2021 -
-
TTS_HW Public
Forked from spolezhaev/TTS_HWPython BSD 3-Clause "New" or "Revised" License UpdatedDec 10, 2021 -
-
linear-transformer-experiments Public
Forked from idiap/linear-transformer-experimentsExperiments using fast linear transformer
Python Other UpdatedSep 26, 2021 -
hse_reinforcement_learning Public
HSE Reinforcement Learning course
Jupyter Notebook UpdatedSep 25, 2021 -
Predators-and-Preys Public
Forked from ArgentumWalker/Predators-and-Preys -
evolution_strategies_openai Public
implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper
-
autograd_but_smaller Public
Simple implementation of reverse-mode automatic differentiation on numpy arrays