11

GitHub - TianhongDai/reinforcement-learning-algorithms: This repository contains...

 5 years ago
source link: https://github.com/TianhongDai/reinforcement-learning-algorithms
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

Deep Reinforcement Learning Alogrithms

MIT License
demo
This repository will implement the classic deep reinforcement learning algorithms. The aim of this repository is to provide clear code for people to learn the deep reinforcement learning algorithm. In the future, more algorithms will be added and the existing codes will also be maintained.

Requirements

Installation

  1. install the pytorch
plase go to official webisite to install it: https://pytorch.org/

Recommend use Anaconda Virtual Environment to manage your packages
  1. install openai-baselines
# clone the openai baselines
git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .

Instructions

  1. select the suitable algorithms
cd <the-rl-algorithm>
  1. all of the parameters are defined in the arguments.py, you can train your model with suitable hyper-parameters.
  2. train the networks
python train_network.py --env-name=<env-name> --cuda (only TRPO not support GPU) --<other-flags>
  1. test the networks
python demo.py --env-name=<env-name>
  1. download the pre-trained models
    Please download them from the Google Driver, then put the saved_models under the corresponding algorithm's folder.

Acknowledgement:

Papers Related to the Deep Reinforcement Learning

[1] A Brief Survey of Deep Reinforcement Learning
[2] The Beta Policy for Continuous Control Reinforcement Learning
[3] Playing Atari with Deep Reinforcement Learning
[4] Deep Reinforcement Learning with Double Q-learning
[5] Dueling Network Architectures for Deep Reinforcement Learning
[6] Continuous control with deep reinforcement learning
[7] Continuous Deep Q-Learning with Model-based Acceleration
[8] Asynchronous Methods for Deep Reinforcement Learning
[9] Trust Region Policy Optimization
[10] Proximal Policy Optimization Algorithms
[11] Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK