Historique des commits

Auteur SHA1 Message Date
  Tobi Lutke 785620467a refactor: reorder output format to put hyde line first il y a 3 mois
  Tobi Lutke 6062dc769f Add named entity extraction to GRPO reward function il y a 4 mois
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs il y a 4 mois
  Tobi Lutke c35dbd6cbd Add comprehensive scoring system for query expansion il y a 4 mois