[RL] Fix loss: use global token normalization instead of per-example#2376
Merged
AlienKevin merged 3 commits intomainfrom Jan 19, 2026
Merged
[RL] Fix loss: use global token normalization instead of per-example#2376AlienKevin merged 3 commits intomainfrom
AlienKevin merged 3 commits intomainfrom