Tobi Lütke
|
bf1b8fc90a
lots of training stuff
|
3 mēneši atpakaļ |
Tobi Lutke
|
7de18ee066
Merge main into finetune
|
3 mēneši atpakaļ |
Tobi Lutke
|
785620467a
refactor: reorder output format to put hyde line first
|
3 mēneši atpakaļ |
Tobi Lütke
|
46ff098361
Change only: format to only:lex (no space after colon)
|
3 mēneši atpakaļ |
Tobias Lütke
|
eb1b77c8cb
Deploy fine-tuned GRPO model as default query expansion (#67)
|
3 mēneši atpakaļ |
Tobi Lutke
|
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
|
4 mēneši atpakaļ |