Tobi Lutke 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 달 전
..
grpo.yaml 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline 3 달 전
sft.yaml 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 달 전