Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate schedule free AdamW #18

Open
JulienMaille opened this issue Apr 5, 2024 · 1 comment
Open

Integrate schedule free AdamW #18

JulienMaille opened this issue Apr 5, 2024 · 1 comment

Comments

@JulienMaille
Copy link

Have you seen this optimizer? Anyone gave it a try? Seems a bit less straightforward to integrate to torchseg since most of our decoders will use BatchNorm.
https://github.com/facebookresearch/schedule_free?tab=readme-ov-file

@JulienMaille JulienMaille closed this as not planned Won't fix, can't repro, duplicate, stale Apr 6, 2024
@JulienMaille JulienMaille reopened this Apr 6, 2024
@notprime
Copy link

notprime commented Apr 6, 2024

Hi @JulienMaille ,

I'll look into this as soon as I come back home, or maybe @isaaccorley can give it a look in the meanwhile.
Btw you're right. By using timm as backbone for the encoders you can now select a specific norm layer, but when it comes to decoders we still have only BatchNorm.

We alredy had the idea to support different norm layers also for decoders, we just have to think about the best way to do that, because we obviously need to specify different parameters for different norm layers (as briefly outlined here).

Once we implement this functionality, integrating free AdamW should be easy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants