1 Star 0 Fork 0

陈狗翔 / ac-ppo

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README

ac-ppo

pytorch implementation Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment

introduction

implement A2C and PPO in pytorch

requirement

  • tensorflow (for tensorboard logging)
  • pytorch (>=1.0, 1.0.1 used in my experiment)
  • gym

a2c

a2c in cartpole and pendulum, the training result shows below

a2c-cartpole a2c.py result of a2c in cartpole-v0

a2c-pendulum a2c_pen.py result of a2c in pendulum-v0, it's quite hard for a2c converge in pendulum..

ppo

ppo-pendulum PPO.py result of ppo in pendulum-v0, somehow still hard to converge..don't know why, any one helps?

ppo improved

ppo-modified PPO_advantage.py more efficient update with generalized advantage estimator (GAE)

空文件

简介

Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment 展开 收起
Python
取消

发行版

暂无发行版

贡献者

全部

近期动态

加载更多
不能加载更多了
Python
1
https://gitee.com/ChenGouXiang/ac-ppo.git
git@gitee.com:ChenGouXiang/ac-ppo.git
ChenGouXiang
ac-ppo
ac-ppo
master

搜索帮助