This website works better with JavaScript
Home
Verkennen
Help
Registreren
Inloggen
suby
/
qmd
Volgen
1
Ster
0
Vork
0
Bestanden
Issues
0
Pull-aanvragen
0
Wiki
Boom:
v1.1.5
Aftakkingen
Labels
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
zoek
Auteur
SHA1
Bericht
Datum
Tobi Lütke
189916d6fb
Move GRPO training out of default finetune pipeline
3 maanden geleden
Tobi Lutke
599935754b
finetune: remove orphaned files and abandoned experiments
3 maanden geleden
Tobi Lütke
102ff861d3
fix: use Qwen3 recommended sampling params to prevent repetition loops
3 maanden geleden
Tobi Lütke
bf1b8fc90a
lots of training stuff
3 maanden geleden