Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline hai 3 meses
..
eval.py 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion hai 3 meses
eval_common.py 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion hai 3 meses
sft.py 739038e1a7 docs: add explicit HuggingFace repo destinations hai 3 meses