Cartpole Environment using REINFORCE

Date

May 12, 2019

Contributor

Ashutosh Tiwari

Project Link

https://github.com/ashutoshtiwari13/Hands-on-DeepRL-and-DL

Project Heads-up

REINFORCE algorithm is based on finding the local maximum of a function using a procedure known as gradient ascent.This class implements the simple Convolution Neuron Network (CNN) model containing only 2 fully-connected levels. In this CNN model, the function reinforce() approximizes the return value (= sum of all rewards with discounts). The environment is solved in 791 episodes!

Cartpole Environment using REINFORCE

Date

Contributor

Categories

Project Link

Project Heads-up

Click 👉 for Project Details

Want to get in touch 🤝 ?

Drop a Hi 😃