Historique des commits

Auteur SHA1 Message Date
  Tobi Lütke bf1b8fc90a lots of training stuff il y a 3 mois
  Tobi Lutke 7de18ee066 Merge main into finetune il y a 3 mois
  Tobi Lutke 785620467a refactor: reorder output format to put hyde line first il y a 3 mois
  Tobi Lütke 46ff098361 Change only: format to only:lex (no space after colon) il y a 3 mois
  Tobias Lütke eb1b77c8cb Deploy fine-tuned GRPO model as default query expansion (#67) il y a 3 mois
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs il y a 4 mois