Quantum error correction code AI-discovery with Jax
-
Updated
Jul 1, 2024 - Jupyter Notebook
Quantum error correction code AI-discovery with Jax
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Official PyTorch implementation of ExpGen (NeurIPS'23).
Engineer-To-Order (ETO) Graph Neural Scheduling (GNS) Project
Deep Reinforcement Learning in C#
Numerical Evidence for Sample Efficiency of Model-Based over Model-Free Reinforcement Learning Control of Partial Differential Equations [ECC'24]
This repository contains the implementation of a transformer-based model combined with a Proximal Policy Optimization (PPO) model to generate trade recommendations. The project leverages the predictive capabilities of transformers for price forecasting and the strategic decision-making of reinforcement learning.
Nokia's classic 'snake' game, written in NumPy and converted into a Gymnasium Environment() for use with gradient-based reinforcement learning algorithms
A Torch Based RL Framework for Rapid Prototyping of Research Papers
Exploring Generalization in Deep Reinforcement Learning algorithms for different tasks using Gymnasium, Gymnasium-Robotics and MuJoCo
Hybrid Transformer based Multi-agent Reinforcement Learning (HTransRL) is for drone coordination in air corridors, addressing the challenges of dynamic dimensions and types of state inputs, which cannot addressed by the traditional MARL.
stable-baselines with JAX & Haiku
Clean baseline implementation of PPO using an episodic TransformerXL memory
Code for "Optimizing ZX-Diagrams with Deep Reinforcement Learning"
JAX Implementation of Proximal Policy Optimisation Algorithm
Snake game environment integrated with OpenAI Gym. Proximal Policy Optimization (PPO) implementation for training. Visualization of training progress and agent performance. Easy to understand code.
✨ Solve multi_dimensional multiple knapsack problem using state_of_the_art Reinforcement Learning Algorithms and transformers
Baseline implementation of recurrent PPO using truncated BPTT
Evaluating the impact of curriculum learning on the training process for an intelligent agent in a video game
Basic 2D car environment trained using reinforcement learning within the Stable Baselines 3 framework
Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."