This website works better with JavaScript
Home
Esplora
Aiuto
Registrati
Accedi
suby
/
qmd
Segui
1
Vota
0
Forka
0
File
Problemi
0
Pull Requests
0
Wiki
Albero (Tree):
2ad507a86e
Rami (Branch)
Tag
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Cronologia Commit
Cerca
Autore
SHA1
Messaggio
Data
Tobi Lutke
2ad507a86e
Add chat template leakage detection to reward function
4 mesi fa
Tobi Lutke
6062dc769f
Add named entity extraction to GRPO reward function
4 mesi fa
Tobi Lutke
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
4 mesi fa