Histórico de Commits

Autor SHA1 Mensagem Data
  Tobi Lütke d6f3688d91 Remove grpo command from default train entrypoint há 3 meses atrás
  Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline há 3 meses atrás
  Tobi Lütke cbeeb1f89b Add wall-clock checkpoints and full eval defaults há 3 meses atrás
  Tobi Lütke bf1b8fc90a lots of training stuff há 3 meses atrás
  Tobi Lutke 38073799c0 chore: clean up finetune folder and fix training workflow há 3 meses atrás
  Tobi Lütke 46ff098361 Change only: format to only:lex (no space after colon) há 3 meses atrás
  Tobias Lütke eb1b77c8cb Deploy fine-tuned GRPO model as default query expansion (#67) há 3 meses atrás