Skip to content

Non-record: Cosine LR Schedule — -0.070 BPB improvement + Focal Loss Investigation (corrected)#1380

Open
ranausmanai wants to merge 5 commits intoopenai:mainfrom
ranausmanai:focal-loss-lm-pretraining
Open

Non-record: Cosine LR Schedule — -0.070 BPB improvement + Focal Loss Investigation (corrected)#1380
ranausmanai wants to merge 5 commits intoopenai:mainfrom
ranausmanai:focal-loss-lm-pretraining

Commits