Histórico de Commits

Autor SHA1 Mensagem Data
  Tobi Lütke d6f3688d91 Remove grpo command from default train entrypoint há 3 meses atrás
  Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline há 3 meses atrás
  Tobi Lutke 1d7d167b29 finetune: strict Pydantic schema, one canonical data format há 3 meses atrás
  Tobi Lütke 57f7caa93b feat: add LiquidAI LFM2 support for query expansion há 3 meses atrás
  Tobi Lutke 785620467a refactor: reorder output format to put hyde line first há 3 meses atrás
  Tobi Lutke 5cf4958bfa Add HuggingFace model card YAML metadata to finetune README há 3 meses atrás
  Tobi Lutke 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion há 3 meses atrás
  Tobi Lutke 5ab78d00a2 Add HF Jobs scripts, temporal query examples, and training results há 3 meses atrás
  Tobi Lutke 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline há 3 meses atrás
  Tobi Lutke b9b1b39a76 Update README with separate model repos há 4 meses atrás
  Tobi Lutke 312c281109 Update README for unified model repository structure há 4 meses atrás
  Tobi Lutke f96766cce8 Fix GRPO model loading to use SFT base first há 4 meses atrás
  Tobi Lutke f6a6716c44 Refactor evals into separate run and score scripts há 4 meses atrás
  Tobi Lutke 6062dc769f Add named entity extraction to GRPO reward function há 4 meses atrás
  Tobi Lutke 994a094546 Update README with final evaluation results há 4 meses atrás
  Tobi Lutke 7cca164dd9 Add query expansion model finetuning infrastructure há 4 meses atrás