Tobi Lütke
|
d6f3688d91
Remove grpo command from default train entrypoint
|
3 месяцев назад |
Tobi Lütke
|
189916d6fb
Move GRPO training out of default finetune pipeline
|
3 месяцев назад |
Tobi Lütke
|
cbeeb1f89b
Add wall-clock checkpoints and full eval defaults
|
3 месяцев назад |
Tobi Lütke
|
bf1b8fc90a
lots of training stuff
|
3 месяцев назад |
Tobi Lutke
|
38073799c0
chore: clean up finetune folder and fix training workflow
|
3 месяцев назад |
Tobi Lütke
|
46ff098361
Change only: format to only:lex (no space after colon)
|
3 месяцев назад |
Tobias Lütke
|
eb1b77c8cb
Deploy fine-tuned GRPO model as default query expansion (#67)
|
3 месяцев назад |