Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline hai 3 meses
..
accelerate_multi_gpu.yaml bf1b8fc90a lots of training stuff hai 3 meses
sft.yaml cbeeb1f89b Add wall-clock checkpoints and full eval defaults hai 3 meses
sft_local.yaml cbeeb1f89b Add wall-clock checkpoints and full eval defaults hai 3 meses