Tobi Lutke
|
7b98d4d308
Link fine-tuned model to HuggingFace in README
|
3 ヶ月 前 |
Tobi Lutke
|
5cf4958bfa
Add HuggingFace model card YAML metadata to finetune README
|
3 ヶ月 前 |
Tobi Lutke
|
8572c2fd94
Deploy fine-tuned GRPO model as default for query expansion
|
3 ヶ月 前 |
Tobi Lutke
|
5ab78d00a2
Add HF Jobs scripts, temporal query examples, and training results
|
3 ヶ月 前 |
Tobi Lutke
|
354744af53
Finetune 2.0: consolidate and simplify the entire training pipeline
|
3 ヶ月 前 |
Tobi Lutke
|
9b3a209a97
Fix GRPO training: apply chat template to prompts
|
4 ヶ月 前 |
Tobi Lutke
|
3ea85eff50
Make TUI model list dynamic from HuggingFace Hub
|
4 ヶ月 前 |
Tobi Lutke
|
891f3262cf
Fix GRPO reward function to handle think blocks and end tokens
|
4 ヶ月 前 |
Tobi Lutke
|
66bb8ed963
Remove beads reference from CLAUDE.md
|
4 ヶ月 前 |
Tobi Lutke
|
2267986302
Remove beads issue tracking
|
4 ヶ月 前 |
Tobi Lutke
|
8a1c4cdab0
Add 1.7B and 4B GRPO training and GGUF conversion scripts
|
4 ヶ月 前 |
Tobi Lutke
|
b9b1b39a76
Update README with separate model repos
|
4 ヶ月 前 |
Tobi Lutke
|
312c281109
Update README for unified model repository structure
|
4 ヶ月 前 |
Tobi Lutke
|
2648512b7c
Fix TUI to load GRPO models with SFT base first
|
4 ヶ月 前 |
Tobi Lutke
|
f96766cce8
Fix GRPO model loading to use SFT base first
|
4 ヶ月 前 |
Tobi Lutke
|
f6a6716c44
Refactor evals into separate run and score scripts
|
4 ヶ月 前 |
Tobi Lutke
|
857a85ab58
Clean up evaluation files
|
4 ヶ月 前 |
Tobi Lutke
|
dc8f5a2335
Strict format validation: every line must be lex:/vec:/hyde:
|
4 ヶ月 前 |
Tobi Lutke
|
2ad507a86e
Add chat template leakage detection to reward function
|
4 ヶ月 前 |
Tobi Lutke
|
6062dc769f
Add named entity extraction to GRPO reward function
|
4 ヶ月 前 |
Tobi Lutke
|
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
|
4 ヶ月 前 |
Tobi Lutke
|
d32e13c172
Add HuggingFace login and comprehensive scoring to GRPO v2 training
|
4 ヶ月 前 |
Tobi Lutke
|
c35dbd6cbd
Add comprehensive scoring system for query expansion
|
4 ヶ月 前 |
Tobi Lutke
|
994a094546
Update README with final evaluation results
|
4 ヶ月 前 |
Tobi Lutke
|
0353994e7d
Fix GRPO training script for TRL API compatibility
|
4 ヶ月 前 |
Tobi Lutke
|
7cca164dd9
Add query expansion model finetuning infrastructure
|
4 ヶ月 前 |
komsit37
|
88f78314bb
Fix sqlite-vec loading with BREW_PREFIX (#42)
|
4 ヶ月 前 |
Tobias Lütke
|
3c7dfad1b6
Make docid lookup more lenient with quotes support (#39)
|
4 ヶ月 前 |
Joshua Lelon Mitchell
|
fbd7fe8c8e
Fix docid lookup in qmd get command (#36)
|
4 ヶ月 前 |
Tobias Lütke
|
5b1671d2f6
Merge pull request #38 from odysseus0/fix/readme-model-sizes
|
4 ヶ月 前 |