Commit History

作者 SHA1 備註 提交日期
  Tobi Lutke 7de18ee066 Merge main into finetune 3 月之前
  Tobi Lutke 785620467a refactor: reorder output format to put hyde line first 3 月之前
  Tobi Lütke 46ff098361 Change only: format to only:lex (no space after colon) 3 月之前
  Tobias Lütke eb1b77c8cb Deploy fine-tuned GRPO model as default query expansion (#67) 3 月之前
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs 4 月之前