Howuhh

💭

What shall I build or write against the fall of night?

Alexander Nikulin Howuhh

💭

What shall I build or write against the fall of night?

RL Researcher @ dunnolab

182 followers · 289 following

Achievements

x2 x2

Achievements

x2 x2

Organizations

Howuhh.github.io Public

HTML 3 Updated Feb 5, 2025
JaxMARL Public
Forked from FLAIROx/JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 1 Apache License 2.0 Updated Dec 4, 2023
pgx Public
Forked from sotetsuk/pgx

🎲 Vectorized RL game environments written in JAX with end-to-end AlphaZero examples

Python 1 Apache License 2.0 Updated Dec 4, 2023
MHRW Public

metropolis-hastings random walk with PySpark

python spark graph graph-algorithms random-walk graph-sampling metropolis-hastings-algorithm

Jupyter Notebook 7 Updated Aug 14, 2023
prioritized_experience_replay Public

Prioritized Experience Replay implementation with proportional prioritization

reinforcement-learning dqn prioritized-experience-replay

Python 75 10 MIT License Updated Jul 18, 2023
Minari Public
Forked from Farama-Foundation/Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python Other Updated May 31, 2023
sac-n-jax Public

Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch

reinforcement-learning flax equinox jax offline-reinforcement-learning d4rl

Python 49 3 MIT License Updated May 21, 2023
memory-maze Public
Forked from jurgisp/memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Python 1 MIT License Updated May 8, 2023
faster-trajectory-transformer Public archive

Implementation of Trajectory Transformer with attention caching and batched beam search

reinforcement-learning transformer trajectory-transformer

Python 110 14 MIT License Updated Apr 27, 2023
link_pred Public

link prediction in social network based on node neighborhoods

Jupyter Notebook Updated Sep 30, 2022
link_pred_spark Public

similarity between graph nodes based on local information with PySpark

spark graph-algorithms pyspark edge-prediction similarity-measures

Python 9 1 Updated Sep 30, 2022
chess_minimax Public

minimax algorithm for chess with alpha-beta pruning

chess minimax minimax-search minimax-algorithm chess-database chess-ai minimax-chess

Jupyter Notebook 8 5 MIT License Updated Aug 23, 2022
d4rl Public
Forked from Farama-Foundation/D4RL

A benchmark for offline reinforcement learning.

Python Apache License 2.0 Updated Aug 17, 2022
cleanrl Public
Forked from vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 1 Other Updated Jul 6, 2022
average_reward_ppo Public

Implementation of "Average-Reward Reinforcement Learning with Trust Region Methods" paper.

Python 8 1 Updated Jun 21, 2022
cic_gym Public

Adaptation of original "Contrastive Intrinsic Control for Unsupervised Skill Discovery" implementation to OpenAI Gym

reinforcement-learning unsupervised-learning intrinsic-reward skill-discovery

Python 3 1 Updated Jun 14, 2022
dul_2021 Public
Forked from GrigoryBartosh/dul_2021

Jupyter Notebook Updated May 24, 2022
halfcheetah_experts Public

expert policies for forward and backflip halfcheetah envs

Python 2 Updated Apr 18, 2022
vector-quantize-pytorch Public
Forked from lucidrains/vector-quantize-pytorch

Vector Quantization, in Pytorch

Python MIT License Updated Mar 10, 2022
mujoco-py Public
Forked from openai/mujoco-py

MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

Cython Other Updated Mar 1, 2022
cic Public
Forked from rll-research/cic

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Python Updated Feb 24, 2022
robosuite Public
Forked from ARISE-Initiative/robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python MIT License Updated Dec 27, 2021
hse_recsys Public

hse recommender systems course

Jupyter Notebook Updated Dec 15, 2021
TTS_HW Public
Forked from spolezhaev/TTS_HW

Python BSD 3-Clause "New" or "Revised" License Updated Dec 10, 2021
hse_bayesian_ml Public

Jupyter Notebook Updated Nov 13, 2021
linear-transformer-experiments Public
Forked from idiap/linear-transformer-experiments

Experiments using fast linear transformer

Python Other Updated Sep 26, 2021
hse_reinforcement_learning Public

HSE Reinforcement Learning course

Jupyter Notebook Updated Sep 25, 2021
Predators-and-Preys Public
Forked from ArgentumWalker/Predators-and-Preys

Python 2 1 Updated Jun 10, 2021
evolution_strategies_openai Public

implementation of "Evolution Strategies as a Scalable Alternative to Reinforcement Learning" OpenAI paper

reinforcement-learning openai-gym gym evolutionary-algorithms evolution-strategies implementation-of-research-paper

Python 20 2 Updated Apr 18, 2021
autograd_but_smaller Public

Simple implementation of reverse-mode automatic differentiation on numpy arrays

autograd backpropagation autodifferentiation

Jupyter Notebook 2 Updated Apr 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alexander Nikulin Howuhh

Achievements

Achievements

Organizations

Block or report Howuhh

Howuhh.github.io Public

JaxMARL Public

pgx Public

MHRW Public

prioritized_experience_replay Public

Minari Public

sac-n-jax Public

memory-maze Public

faster-trajectory-transformer Public archive

link_pred Public

link_pred_spark Public

chess_minimax Public

d4rl Public

cleanrl Public

average_reward_ppo Public

cic_gym Public

dul_2021 Public

halfcheetah_experts Public

vector-quantize-pytorch Public

mujoco-py Public

cic Public

robosuite Public

hse_recsys Public

TTS_HW Public

hse_bayesian_ml Public

linear-transformer-experiments Public

hse_reinforcement_learning Public

Predators-and-Preys Public

evolution_strategies_openai Public

autograd_but_smaller Public