Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline há 3 meses atrás
..
accelerate_multi_gpu.yaml bf1b8fc90a lots of training stuff há 3 meses atrás
sft.yaml cbeeb1f89b Add wall-clock checkpoints and full eval defaults há 3 meses atrás
sft_local.yaml cbeeb1f89b Add wall-clock checkpoints and full eval defaults há 3 meses atrás