README.md

Deep Reinforcement Learning Alogrithms

This repository will implement the classic deep reinforcement learning algorithms. The aim of this repository is to provide clear code for people to learn the deep reinforcement learning algorithm. In the future, more algorithms will be added and the existing codes will also be maintained.

Deep Q-Learning Network(DQN)
Double DQN(DDQN)
Deep Deterministic Policy Gradient(DDPG)
Dueling Network Architecture
Advantage Actor-Critic(A2C)
Trust Region Policy Optimization(TRPO)
Proximal Policy Optimization(PPO)
Actor Critic using Kronecker-Factored Trust Region(ACKTR)

Requirements

python-3.5.2
openai-gym
gym_ple
mujoco-py-1.50.1.56
pytorch-0.4.0
openai-baselines

Installation

install the pytorch

plase go to official webisite to install it: https://pytorch.org/

Recommend use Anaconda Virtual Environment to manage your packages

install openai-baselines

# clone the openai baselines
git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

Instructions

select the suitable algorithms

cd <the-rl-algorithm>

all of the parameters are defined in the arguments.py, you can train your model with suitable hyper-parameters.
train the networks

python train_network.py --env-name=<env-name> --cuda (only TRPO not support GPU) --<other-flags>

test the networks

python demo.py --env-name=<env-name>

download the pre-trained models
Please download them from the Google Driver, then put the saved_models under the corresponding algorithm's folder.

Acknowledgement:

Papers Related to the Deep Reinforcement Learning

[1] A Brief Survey of Deep Reinforcement Learning
[2] The Beta Policy for Continuous Control Reinforcement Learning
[3] Playing Atari with Deep Reinforcement Learning
[4] Deep Reinforcement Learning with Double Q-learning
[5] Dueling Network Architectures for Deep Reinforcement Learning
[6] Continuous control with deep reinforcement learning
[7] Continuous Deep Q-Learning with Model-based Acceleration
[8] Asynchronous Methods for Deep Reinforcement Learning
[9] Trust Region Policy Optimization
[10] Proximal Policy Optimization Algorithms
[11] Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

GitHub - TianhongDai/reinforcement-learning-algorithms: This repository contains...

README.md

Deep Reinforcement Learning Alogrithms

Requirements

Installation

Instructions

Acknowledgement:

Papers Related to the Deep Reinforcement Learning

Recommend

促销活动:京东生鲜好价汇总（附购买组合）满199-100/2件7折，叠加129-30券_京东优惠

新品发售:QIN 多亲 Qin1 功能手机（内置小爱同学） 199元包邮_小米有品优惠

听说微信还信用卡要收费了！

iOS FPS 监控

vue和微信小程序的区别、比较

Bursting particle effect buttons for React ?

This Week in Data with Colin Charles 47: MySQL 8.0.12 and It’s Time To Submit!

HugeGraph – An open source fast and highly scalable graph database

华为交换机上传VRP至FTP服务器备份-学习与分享、共勉与进步

四年都毁这里了：中国最惨大学校区

About Joyk