1 Star 0 Fork 0

陈狗翔 / REINFORCE-DDPG

加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
该仓库未声明开源许可证文件(LICENSE),使用请关注具体项目描述及其代码上游依赖。
克隆/下载
贡献代码
同步代码
取消
提示: 由于 Git 不支持空文件夾,创建文件夹后会生成空的 .keep 文件
Loading...
README

REINFORCE-DDPG

the implement of REINFORCE algorithm and DDPG algorithm in pytorch

all code is in one file and easily to follow

requirment

  • tensorboardX (for logging, you can delete the logging code if you don't need)
  • pytorch (>= 1.0, 1.0.1 used in my experiment)
  • gym

REINFORCE

only in CartPole-v0 environment, can not learn well in Pendulum-v0

DDPG

only in Pendulum-v0 for ddpg only suit for continuous task

Compare soft-update and target network update

in pendulum-v0

TD-3 version with 2 critic networks and soft update, soft update version is the one in ddpg original paper, hard update version is the one with the same target network update with DQN which is every C time hard update.

空文件

简介

the implement of REINFORCE algorithm and DDPG algorithm 展开 收起
Python
取消

发行版

暂无发行版

贡献者

全部

近期动态

加载更多
不能加载更多了
Python
1
https://gitee.com/ChenGouXiang/REINFORCE-DDPG.git
git@gitee.com:ChenGouXiang/REINFORCE-DDPG.git
ChenGouXiang
REINFORCE-DDPG
REINFORCE-DDPG
master

搜索帮助