Reinforcement-Learning-Playground

Mountain Car Problem - Continuous

Used machin library to solve the Continuous Mountain Car Problem using PPO and TD3. Implemented the right actor-critic networks and found the right hyper-parameters.

Expected Return per iteration.

Pendulum swing-up - Torques

Used RobotDART, OpenAI Gym spaces, created reward function and used PPO and TD3. Used Frame Skipping technique. The initial position is defined as x0 = [π], the observation space is the vector: [cos θ, sin θ, torque], and the reward function uses the angle θ, torque, and the command given to the robot.

TD3 - Expected return per iteration.

PPO - Expected return per iteration.

Iiwa joint space RL-controller - Servo

Used RobotDART, OpenAI Gym spaces, created reward function and used PPO and TD3. The observation space is a vector that contains all the positions and velocities of the robot's joints, and the reward function is the norm of the difference between the final and current positions.

TD3 - Expected return per iteration.

PPO - Expected return per iteration.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
MountainCarContinuous		MountainCarContinuous
PendulumSwingUp		PendulumSwingUp
iiwa		iiwa
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement-Learning-Playground

Mountain Car Problem - Continuous

Pendulum swing-up - Torques

Iiwa joint space RL-controller - Servo

About

Releases

Packages

Languages

kounelisagis/Reinforcement-Learning-Playground

Folders and files

Latest commit

History

Repository files navigation

Reinforcement-Learning-Playground

Mountain Car Problem - Continuous

Pendulum swing-up - Torques

Iiwa joint space RL-controller - Servo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages