Commit Verlauf

Autor SHA1 Nachricht Datum
  Tobi Lutke 2648512b7c Fix TUI to load GRPO models with SFT base first vor 4 Monaten
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs vor 4 Monaten