Commit Verlauf

Autor SHA1 Nachricht Datum
  Tobi Lütke d6f3688d91 Remove grpo command from default train entrypoint vor 3 Monaten
  Tobi Lütke 189916d6fb Move GRPO training out of default finetune pipeline vor 3 Monaten
  Tobi Lutke 1d7d167b29 finetune: strict Pydantic schema, one canonical data format vor 3 Monaten
  Tobi Lutke 739038e1a7 docs: add explicit HuggingFace repo destinations vor 3 Monaten
  Tobi Lutke 38073799c0 chore: clean up finetune folder and fix training workflow vor 3 Monaten
  Tobi Lutke 533f0eed37 docs: add finetune CLAUDE.md and update training workflow vor 3 Monaten