This website works better with JavaScript
Home
Verkennen
Help
Registreren
Inloggen
suby
/
qmd
Volgen
1
Ster
0
Vork
0
Bestanden
Issues
0
Pull-aanvragen
0
Wiki
Boom:
2ad507a86e
Aftakkingen
Labels
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
zoek
Auteur
SHA1
Bericht
Datum
Tobi Lutke
2ad507a86e
Add chat template leakage detection to reward function
4 maanden geleden
Tobi Lutke
6062dc769f
Add named entity extraction to GRPO reward function
4 maanden geleden
Tobi Lutke
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
4 maanden geleden