Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Solved lab problems, slides and notes of the Deep Reinforcement Learning bootcamp 2017 held at UCBerkeley
2017年买房经历总结出来的买房购房知识分享给大家,希望对大家有所帮助。买房不易,且买且珍惜。Sharing the knowledge of buy an own house that according to the experience at hangzhou in 2017 to all the people. It's not easy to buy a own house, so I hope that it would be useful to everyone.
UC Berkeley notes and lecture slides on the ongoing course of Deep RL
discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!