This website works better with JavaScript
Inicio
Explorar
Axuda
Rexistro
Iniciar sesión
suby
/
qmd
Seguir
1
Destacar
0
Fork
0
Ficheiros
Incidencias
0
Pull Requests
0
Wiki
Árbore:
2ae1baba2f
Ramas
Etiquetas
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
Buscar
Autor
SHA1
Mensaxe
Data
Tobi Lütke
189916d6fb
Move GRPO training out of default finetune pipeline
hai 3 meses
Tobi Lutke
599935754b
finetune: remove orphaned files and abandoned experiments
hai 3 meses
Tobi Lütke
102ff861d3
fix: use Qwen3 recommended sampling params to prevent repetition loops
hai 3 meses
Tobi Lütke
bf1b8fc90a
lots of training stuff
hai 3 meses