Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline 3 ماه پیش
..
gepa 1d7d167b29 finetune: strict Pydantic schema, one canonical data format 3 ماه پیش
grpo 189916d6fb Move GRPO training out of default finetune pipeline 3 ماه پیش
lfm2 1d7d167b29 finetune: strict Pydantic schema, one canonical data format 3 ماه پیش