Rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch

4 years ago

source link: https://rlpyt.readthedocs.io/en/latest/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

rlpyt includes modular, optimized implementations of common deep RL algorithms in PyTorch, with unified infrastructure supporting all three major families of model-free algorithms: policy gradient, deep-q learning, and q-function policy gradient. It is intended to be a high-throughput code-base for small- to medium-scale research (large-scale meaning like OpenAI Dota with 100’s GPUs). A conceptual overview is provided in the white paper , and the code (with examples) in the github repository .

This documentation aims to explain the intent of the code structure, to make it easier to use and modify (it might not detail every keyword argument as in a fixed library). See the github README for installation instructions and other introductory notes. Please share any questions or comments to do with documenantation on the github issues.

The sections are organized as follows. First, several of the base classes are introduced. Then, each algorithm family and associated agents and models are grouped together. Infrastructure code such as the runner classes and sampler classes are covered next. All the remaining components are covered thereafter, in no particular order.

Recommend

Rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch

Recommend

面向 Kaggle 和离线比赛实用工具库 nyaggle，解决特征工程与验证两大难题（附代码）

使用PyTorch建立你的第一个文本分类模型

f-GAN简介：GAN模型的生产车间

推荐一个很棒的开源工作流elsa-core

为了不复制粘贴，我被逼着学会了JAVA爬虫

科技部要求加强对实验室特别是对病毒的管理

火神山医院被风吹走了？还严重漏水？别信。

继湖北后滴滴发出全国网约车租金顺延1个月倡议

董明珠致敬武汉员工格力陆续复工加紧生产抗疫产品

新冠药物和疫苗进展如何？科学家来答疑了

About Joyk