README.md

Reinforcement Learning with Model-Agnostic Meta-Learning (MAML)

Implementation of Model-Agnostic Meta-Learning (MAML) applied on Reinforcement Learning problems in Pytorch. This repository includes environments introduced in (Duan et al., 2016, Finn et al., 2017): multi-armed bandits, tabular MDPs, continuous control with MuJoCo, and 2D navigation task.

Getting started

To avoid any conflict with your existing Python setup, and to keep this project self-contained, it is suggested to work in a virtual environment with virtualenv. To install virtualenv:

pip install --upgrade virtualenv

Create a virtual environment, activate it and install the requirements in requirements.txt.

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Usage

You can use the main.py script in order to run reinforcement learning experiments with MAML. This script was tested with Python 3.5. Note that some environments may also work with Python 2.7 (all experiments besides MuJoCo-based environments).

python main.py --env-name HalfCheetahDir-v1 --num-workers 8 --fast-lr 0.1 --max-kl 0.01 --fast-batch-size 20 --meta-batch-size 40 --num-layers 2 --hidden-size 100 --num-batches 1000 --gamma 0.99 --tau 1.0 --cg-damping 1e-5 --ls-max-steps 15 --output-folder maml-halfcheetah-dir --device cuda

References

This project is, for the most part, a reproduction of the original implementation cbfinn/maml_rl in Pytorch. These experiments are based on the paper

Chelsea Finn, Pieter Abbeel, and Sergey Levine. Model-agnostic meta-learning for fast adaptation of deep networks. International Conference on Machine Learning (ICML), 2017 [ArXiv]

If you want to cite this paper

@article{DBLP:journals/corr/FinnAL17,
  author    = {Chelsea Finn and Pieter Abbeel and Sergey Levine},
  title     = {Model-{A}gnostic {M}eta-{L}earning for {F}ast {A}daptation of {D}eep {N}etworks},
  journal   = {International Conference on Machine Learning (ICML)},
  year      = {2017},
  url       = {http://arxiv.org/abs/1703.03400}
}

GitHub - tristandeleu/pytorch-maml-rl: Reinforcement Learning with Model-Agnosti...

README.md

Reinforcement Learning with Model-Agnostic Meta-Learning (MAML)

Getting started

Usage

References

Recommend

人工智能的今天

Debian 9发布第五次更新

微软即将推出自有品牌安卓手机？

GuixSD 支持事务和回滚包管理方式的发行版

《Rapid GUI Programming with Python and Qt》pdf电子书免费下载

《Linux Bible (Ninth Edition)》pdf电子书免费下载

资源公社 - 总结了各类学习资源，包含网络 IT，办公类，设计制作等各行业的精品学习资...

从"宠儿"到"弃儿"，IBM Watson怎么了？

关键系统的 JVM 参数推荐（2018 仲夏版）

iOS图形处理概论：OpenGL ES，Metal，Core Graphics，Core Im... - 简书

About Joyk