This website works better with JavaScript
Página inicial
Explorar
Ajuda
Registrar
Entrar
suby
/
qmd
Observar
1
Favorito
0
Fork
0
Arquivos
Issues
0
Pull Requests
0
Wiki
Tree:
3ea85eff50
Branches
Tags
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Histórico de commits
Buscar
Autor
SHA1
Mensagem
Data
Tobi Lutke
891f3262cf
Fix GRPO reward function to handle think blocks and end tokens
4 meses atrás
Tobi Lutke
8a1c4cdab0
Add 1.7B and 4B GRPO training and GGUF conversion scripts
4 meses atrás