Historial de Commits

Autor SHA1 Mensaje Fecha
  Tobi Lütke bf1b8fc90a lots of training stuff hace 3 meses
  Tobi Lutke 7de18ee066 Merge main into finetune hace 3 meses
  Tobi Lutke 785620467a refactor: reorder output format to put hyde line first hace 3 meses
  Tobi Lütke 46ff098361 Change only: format to only:lex (no space after colon) hace 3 meses
  Tobias Lütke eb1b77c8cb Deploy fine-tuned GRPO model as default query expansion (#67) hace 3 meses
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs hace 4 meses