Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline 3 ヶ月 前
..
eval.py 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 ヶ月 前
eval_common.py 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 ヶ月 前
sft.py 739038e1a7 docs: add explicit HuggingFace repo destinations 3 ヶ月 前