You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hyperparametrization is one of the most time/cost expensive thing when training RL agents. May be this implementation saves some time/cost to some people and it could be the first AC algorithms that deals with meta-gradients to make improvements from here.
Pitch
I would like some to guide me of where to start or to give me some key insights of the posibilities of coding this.
Alternatives
The alternatives are that someone codes it by him/herself.
Additional context
No response
Checklist
I have checked that there is no similar issue in the repo
If I'm requesting a new feature, I have proposed alternatives
The text was updated successfully, but these errors were encountered:
araffin
added
Maintainers on vacation
Maintainers are on vacation so they can recharge their batteries, we will be back soon ;)
and removed
Maintainers on vacation
Maintainers are on vacation so they can recharge their batteries, we will be back soon ;)
labels
Jan 4, 2024
🚀 Feature
Build the STAC algorithm as a callable algorithm: https://arxiv.org/pdf/2002.12928.pdf
Motivation
Hyperparametrization is one of the most time/cost expensive thing when training RL agents. May be this implementation saves some time/cost to some people and it could be the first AC algorithms that deals with meta-gradients to make improvements from here.
Pitch
I would like some to guide me of where to start or to give me some key insights of the posibilities of coding this.
Alternatives
The alternatives are that someone codes it by him/herself.
Additional context
No response
Checklist
The text was updated successfully, but these errors were encountered: