Skip to content

Added example training scripts for localsgd, DiLoCo, Live Checkpoint Recovery, and proactive failure detection with DDP (#198)#200

Open
WarrenZhu050413 wants to merge 2 commits intometa-pytorch:mainfrom
WarrenZhu050413:torchft_examples