This website works better with JavaScript
Home
Explore
Help
Register
Sign In
suby
/
qmd
Watch
1
Star
0
Fork
0
Files
Issues
0
Pull Requests
0
Wiki
Tree:
2ad507a86e
Branches
Tags
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
Find
Author
SHA1
Message
Date
Tobi Lutke
2ad507a86e
Add chat template leakage detection to reward function
4 months ago
Tobi Lutke
6062dc769f
Add named entity extraction to GRPO reward function
4 months ago
Tobi Lutke
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
4 months ago