Tobi Lutke 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline 3 달 전
..
grpo.yaml 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline 3 달 전
sft.yaml 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline 3 달 전