temporal-difference-algorithms

Star

Here are 11 public repositories matching this topic...

JayLohokare / temporal-difference-implementation

Star

Implementation of various temporal difference algorithms for OpenAI Cliff walking (Gridworld Cliff)

qlearning sarsa-learning temporal-difference-algorithms gridworld-cliff

Updated Oct 29, 2018
Jupyter Notebook

katnoria / td-methods

Star

Notebooks covering temporal difference methods using OpenAI Gym

reinforcement-learning gym temporal-difference-algorithms reinforcem td-methods

Updated Apr 17, 2019
Jupyter Notebook

batra98 / Monte_Carlo-TD-Function_Approximation

Star

Use Monte-Carlo (MC) Methods and Temporal Difference (TD) Learning on couple of games and toy problems.

reinforcement-learning-algorithms monte-carlo-methods temporal-difference-algorithms

Updated Jul 15, 2020
Jupyter Notebook

ashutoshtiwari13 / Demystifying-Deep-Reinforcement-Learning

Star

⚡️ Code and Notes 📝 for Grokking Deep RL and RL: An Introduction by Sutton & Barto(2nd edition, 2018) 🤘

reinforcement-learning deep-reinforcement-learning reinforcement-learning-algorithms grokking-algorithms monte-carlo-methods deep-rl sutton-book temporal-difference-algorithms sutton-barto-book

Updated Jul 25, 2020
Jupyter Notebook

hmohebbi / MDP_TabularMethods

Star

The implementation of tabular solution methods in Reinformcement Learning, Sutton's book: Part I

reinforcement-learning monte-carlo-methods temporal-difference-algorithms tabular-methods

Updated Feb 7, 2021
Jupyter Notebook

PieroMacaluso / reinforcement-learning-stuff

Star

Just a bunch of exercises created during my thesis work working on Reinforcement Learning.

reinforcement-learning openai-gym temporal-difference reinforcement-learning-environments temporal-difference-algorithms reinforcement-learning-exercises

Updated Dec 8, 2022
Python

arya-ebrahimi / reinforcement-learning-spring2023

Star

This repository contains my solutions for homeworks and exercises of Reinforcement Learning course at the Ferdowsi University of Mashhad, Spring 2023

reinforcement-learning q-learning policy-gradient sarsa monte-carlo-tree-search multi-armed-bandit q-learning-vs-sarsa temporal-difference-algorithms monte-carlo-control

Updated Jul 12, 2023
Jupyter Notebook

arya-ebrahimi / rl-playground

Star

tabular and deep rl algorithms

reinforcement-learning pytorch dynamic-programming temporal-differencing-learning temporal-difference-algorithms dqn-pytorch ddpg-pytorch ppo-pytorch sac-pytorch td3-pytorch

Updated Oct 12, 2023
Jupyter Notebook

VEXLife / Accelerated-TD

Star

My Implementation of the Accelerated Gradient Temporal Difference Learning algorithm in Python

reinforcement-learning reinforcement-learning-algorithms td atd random-walk temporal-differencing-learning temporal-difference temporal-difference-algorithms temporal-difference-learning accelerated-td

Updated Jan 25, 2024
Python

kkm24132 / ReinforcementLearning

Star

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

reinforcement-learning q-learning policy-gradient sarsa multi-armed-bandits montecarlo linear-function-approximation exploration-exploitation temporal-difference-algorithms

Updated Jun 5, 2024
Jupyter Notebook

moporgic / TDL2048

Star

The Most Efficient Temporal Difference Learning Framework for 2048

machine-learning framework machine-learning-algorithms 2048 2048-game 2048-ai temporal-difference-algorithms 2048-solver temporal-difference-learning n-tuple-networks

Updated Sep 4, 2024
C++

Improve this page

Add a description, image, and links to the temporal-difference-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the temporal-difference-algorithms topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

temporal-difference-algorithms

Here are 11 public repositories matching this topic...

JayLohokare / temporal-difference-implementation

katnoria / td-methods

batra98 / Monte_Carlo-TD-Function_Approximation

ashutoshtiwari13 / Demystifying-Deep-Reinforcement-Learning

hmohebbi / MDP_TabularMethods

PieroMacaluso / reinforcement-learning-stuff

arya-ebrahimi / reinforcement-learning-spring2023

arya-ebrahimi / rl-playground

VEXLife / Accelerated-TD

kkm24132 / ReinforcementLearning

moporgic / TDL2048

Improve this page

Add this topic to your repo