Popular repositories Loading
-
ddvi
ddvi PublicForked from arakhsha/ddvi
Official Code for the paper: Deflated Dynamics Value Iteratioon
Python 1
-
reppo
reppo PublicForked from cvoelcker/reppo
[Adage Lab version] Official Code for "Relative Entropy Pathwise Policy Optimization"
Python 1
-
vagram
vagram PublicForked from pairlab/vagram
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
Python
Repositories
- reppo Public Forked from cvoelcker/reppo
[Adage Lab version] Official Code for "Relative Entropy Pathwise Policy Optimization"
adaptive-agents-lab/reppo’s past year of commit activity - CVAML Public
Official code for the paper "Calibrated Value-Aware Model Learning with Stochastic Environment Models
adaptive-agents-lab/CVAML’s past year of commit activity - pandas Public Forked from averyma/pandas
[ICML'25] PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
adaptive-agents-lab/pandas’s past year of commit activity - MAD-TD Public
Code for "MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL", ICLR 2024, Voelcker et a.l/
adaptive-agents-lab/MAD-TD’s past year of commit activity - ddvi Public Forked from arakhsha/ddvi
Official Code for the paper: Deflated Dynamics Value Iteratioon
adaptive-agents-lab/ddvi’s past year of commit activity - mdot_tnt Public Forked from metekemertas/mdot_tnt
Official code repository for the MDOT-TNT algorithm for discrete optimal transport.
adaptive-agents-lab/mdot_tnt’s past year of commit activity - understanding_auxiliary_tasks Public
The official code for the paper "When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning" published at RLC2024 [https://rlj.cs.umass.edu/2024/papers/Paper197.html]
adaptive-agents-lab/understanding_auxiliary_tasks’s past year of commit activity - opt-robust Public Forked from averyma/opt-robust
[TMLR] Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods.
adaptive-agents-lab/opt-robust’s past year of commit activity - model-alignment Public Forked from averyma/model-alignment
[ECCV'24] Improving Adversarial Transferability via Model Alignment
adaptive-agents-lab/model-alignment’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…