Quickly learn policies for continuous control in sparse reward environments
-
Updated
Feb 17, 2021 - Python
Quickly learn policies for continuous control in sparse reward environments
Code for some fun exercises in the textbook 'Reinforcement Learning - An Introduction'
Deep Q-Network, Actor-critic , Policy gradient implementation in python
Example A2C implementation with ReLAx
My reports for the reinforcement learning class given at the ENS
Implementing Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". using TensorFlow
policy gradient for pong
[Reinforcement Learning, forked from Stable-baselines3] Étude des performances des algorithmes de Reinforcement Learning sur Pendulum
Programming Assignments for Reinforcement Learning Specialization
Scheduling TRPO's KL Divergence Constraint
This repository provides an implementation of Othello game playing agents trained using reinforcement learning techniques.
Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment
Example TRPO implementation with ReLAx
Example PPO implementation with ReLAx
Deep Q network and Policy gradient reinforcement learning alogrithms to play pacman
Ben Gurion University "Deep Reinforcement Learning (372.2.5910)" course assignments & solutions
Pytorch implementation of Deep Deterministic Policy Gradients (DDPG)
A collection of RL algorithms in PyTorch
The homework for Cutting-Edge of Deep Learning, aka CEDL, from NTHU
Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."