This website works better with JavaScript
Home
Verkennen
Help
Registreren
Inloggen
suby
/
qmd
Volgen
1
Ster
0
Vork
0
Bestanden
Issues
0
Pull-aanvragen
0
Wiki
Boom:
3ea85eff50
Aftakkingen
Labels
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
zoek
Auteur
SHA1
Bericht
Datum
Tobi Lutke
891f3262cf
Fix GRPO reward function to handle think blocks and end tokens
4 maanden geleden
Tobi Lutke
8a1c4cdab0
Add 1.7B and 4B GRPO training and GGUF conversion scripts
4 maanden geleden