Historique des commits

Auteur SHA1 Message Date
  Tobi Lutke b9b1b39a76 Update README with separate model repos il y a 4 mois
  Tobi Lutke 312c281109 Update README for unified model repository structure il y a 4 mois
  Tobi Lutke f96766cce8 Fix GRPO model loading to use SFT base first il y a 4 mois
  Tobi Lutke f6a6716c44 Refactor evals into separate run and score scripts il y a 4 mois
  Tobi Lutke 6062dc769f Add named entity extraction to GRPO reward function il y a 4 mois
  Tobi Lutke 994a094546 Update README with final evaluation results il y a 4 mois
  Tobi Lutke 7cca164dd9 Add query expansion model finetuning infrastructure il y a 4 mois