This website works better with JavaScript
Home
Explore
Help
Register
Sign In
suby
/
qmd
Watch
1
Star
0
Fork
0
Files
Issues
0
Pull Requests
0
Wiki
Tree:
3ea85eff50
Branches
Tags
main
oivo
v2.1.0-upstream
v2.1.0
v2.0.1
v2.0.0
v1.1.6
v1.1.5
v1.1.2
v1.1.1
v1.0.7
v1.0.6
v1.0.5
v1.0.0
v0.9.0
Commit History
Find
Author
SHA1
Message
Date
Tobi Lutke
891f3262cf
Fix GRPO reward function to handle think blocks and end tokens
4 months ago
Tobi Lutke
8a1c4cdab0
Add 1.7B and 4B GRPO training and GGUF conversion scripts
4 months ago