GitHub - dennybritz/reinforcement-learning: Implementation of Reinforcement Lear... - JOYK Joy of Geek, Geek News, Link all geek

README.md

Overview

This repository provides code, exercises and solutions for popular Reinforcement Learning algorithms. These are meant to serve as a learning tool to complement the theoretical materials from

Each folder in corresponds to one or more chapters of the above textbook and/or course. In addition to exercises and solution, each folder also contains a list of learning goals, a brief concept summary, and links to the relevant readings.

All code is written in Python 3 and uses RL environments from OpenAI Gym. Advanced techniques use Tensorflow for neural network implementations.

Introduction to RL problems & OpenAI Gym
MDPs and Bellman Equations
Dynamic Programming: Model-Based RL, Policy Iteration and Value Iteration
Monte Carlo Model-Free Prediction & Control
Temporal Difference Model-Free Prediction & Control
Function Approximation
Deep Q Learning (WIP)
Policy Gradient Methods (WIP)
Learning and Planning (WIP)
Exploration and Exploitation (WIP)

List of Implemented Algorithms

Dynamic Programming Policy Evaluation
Dynamic Programming Policy Iteration
Dynamic Programming Value Iteration
Monte Carlo Prediction
Monte Carlo Control with Epsilon-Greedy Policies
Monte Carlo Off-Policy Control with Importance Sampling
SARSA (On Policy TD Learning)
Q-Learning (Off Policy TD Learning)
Q-Learning with Linear Function Approximation
Deep Q-Learning for Atari Games
Double Deep-Q Learning for Atari Games
Deep Q-Learning with Prioritized Experience Replay (WIP)
Policy Gradient: REINFORCE with Baseline
Policy Gradient: Actor Critic with Baseline
Policy Gradient: Actor Critic with Baseline for Continuous Action Spaces
Deterministic Policy Gradients for Continuous Action Spaces (WIP)
Deep Deterministic Policy Gradients (DDPG) (WIP)
Asynchronous Advantage Actor Critic (A3C)

Resources

Textbooks:

Reinforcement Learning: An Introduction (2nd Edition)

Classes:

Talks/Tutorials:

Other Projects:

Selected Papers:

GitHub - dennybritz/reinforcement-learning: Implementation of Reinforcement Lear...

README.md

Overview

Table of Contents

List of Implemented Algorithms

Resources

Recommend

氪星晚报丨科大讯飞回应同传翻译造假；摩拜回应数据质疑；传比特大陆将终止IPO

初中左右的学历可以学 tensorflow 这样高大上的东西吗？该怎么入门呢？以前上学的时...

创投观察 | 寻找教育独角兽（3): 线下教育被忽视的 5 大方向

AI女王：别怕人工智能，它的未来掌握在我们手中

小说真的会影响你的三观吗？ - 知乎

Google 为 Gmail 数据分享辩护

Google 雇员曾私下讨论调整搜索功能反击特朗普旅行禁令

京东盯上了新iPhone的售后生意，想让你手机坏了不维修只换新

余额宝平均收益连续跌破3%，10万本金一天只赚不到8块

「光鉴科技」发布手机3D结构光模组，成本可低至10美元

About Joyk