Tobi Lutke 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 月之前
..
train 32706a720f Refactor finetune folder: train/rl scripts with YAML configs 4 月之前
train_v2 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 月之前
qmd_expansion_v2.jsonl 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion 3 月之前