加入 Gitee

与超过 1200万开发者一起发现、参与优秀开源项目，私有仓库也完全免费：）

免费加入

该仓库未声明开源许可证文件（LICENSE），使用请关注具体项目描述及其代码上游依赖。

克隆/下载

README.md 876 Bytes

一键复制编辑原始数据按行查看历史

提交于 2019-10-31 17:37 . update readme

ac-ppo

pytorch implementation Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment

introduction

implement A2C and PPO in pytorch

requirement

tensorflow (for tensorboard logging)
pytorch (>=1.0, 1.0.1 used in my experiment)
gym

a2c

a2c in cartpole and pendulum, the training result shows below

a2c-cartpole a2c.py result of a2c in cartpole-v0

a2c-pendulum a2c_pen.py result of a2c in pendulum-v0, it's quite hard for a2c converge in pendulum..

ppo

ppo-pendulum PPO.py result of ppo in pendulum-v0, somehow still hard to converge..don't know why, any one helps?

ppo improved

ppo-modified PPO_advantage.py more efficient update with generalized advantage estimator (GAE)

Python

https://gitee.com/ChenGouXiang/ac-ppo.git

git@gitee.com:ChenGouXiang/ac-ppo.git

ChenGouXiang

ac-ppo

master

陈狗翔 / ac-ppo

ac-ppo

introduction

requirement

a2c

ppo

ppo improved

简介

发行版

贡献者

近期动态

陈狗翔 / ac-ppo .gitee-modal { width: 500px !important; }

ac-ppo

introduction

requirement

a2c

ppo

ppo improved

简介

发行版

贡献者

近期动态

搜索帮助

陈狗翔 / ac-ppo