Tobi Lutke
|
785620467a
refactor: reorder output format to put hyde line first
|
3 месяцев назад |
Tobi Lutke
|
5cf4958bfa
Add HuggingFace model card YAML metadata to finetune README
|
3 месяцев назад |
Tobi Lutke
|
8572c2fd94
Deploy fine-tuned GRPO model as default for query expansion
|
3 месяцев назад |
Tobi Lutke
|
5ab78d00a2
Add HF Jobs scripts, temporal query examples, and training results
|
3 месяцев назад |
Tobi Lutke
|
354744af53
Finetune 2.0: consolidate and simplify the entire training pipeline
|
3 месяцев назад |
Tobi Lutke
|
b9b1b39a76
Update README with separate model repos
|
4 месяцев назад |
Tobi Lutke
|
312c281109
Update README for unified model repository structure
|
4 месяцев назад |
Tobi Lutke
|
f96766cce8
Fix GRPO model loading to use SFT base first
|
4 месяцев назад |
Tobi Lutke
|
f6a6716c44
Refactor evals into separate run and score scripts
|
4 месяцев назад |
Tobi Lutke
|
6062dc769f
Add named entity extraction to GRPO reward function
|
4 месяцев назад |
Tobi Lutke
|
994a094546
Update README with final evaluation results
|
4 месяцев назад |
Tobi Lutke
|
7cca164dd9
Add query expansion model finetuning infrastructure
|
4 месяцев назад |