Tobi Lutke
|
5ab78d00a2
Add HF Jobs scripts, temporal query examples, and training results
|
hace 3 meses |
Tobi Lutke
|
354744af53
Finetune 2.0: consolidate and simplify the entire training pipeline
|
hace 3 meses |
Tobi Lutke
|
9b3a209a97
Fix GRPO training: apply chat template to prompts
|
hace 3 meses |
Tobi Lutke
|
3ea85eff50
Make TUI model list dynamic from HuggingFace Hub
|
hace 3 meses |
Tobi Lutke
|
891f3262cf
Fix GRPO reward function to handle think blocks and end tokens
|
hace 3 meses |
Tobi Lutke
|
66bb8ed963
Remove beads reference from CLAUDE.md
|
hace 3 meses |
Tobi Lutke
|
2267986302
Remove beads issue tracking
|
hace 3 meses |
Tobi Lutke
|
8a1c4cdab0
Add 1.7B and 4B GRPO training and GGUF conversion scripts
|
hace 4 meses |
Tobi Lutke
|
b9b1b39a76
Update README with separate model repos
|
hace 4 meses |
Tobi Lutke
|
312c281109
Update README for unified model repository structure
|
hace 4 meses |
Tobi Lutke
|
2648512b7c
Fix TUI to load GRPO models with SFT base first
|
hace 4 meses |
Tobi Lutke
|
f96766cce8
Fix GRPO model loading to use SFT base first
|
hace 4 meses |
Tobi Lutke
|
f6a6716c44
Refactor evals into separate run and score scripts
|
hace 4 meses |
Tobi Lutke
|
857a85ab58
Clean up evaluation files
|
hace 4 meses |
Tobi Lutke
|
dc8f5a2335
Strict format validation: every line must be lex:/vec:/hyde:
|
hace 4 meses |
Tobi Lutke
|
2ad507a86e
Add chat template leakage detection to reward function
|
hace 4 meses |
Tobi Lutke
|
6062dc769f
Add named entity extraction to GRPO reward function
|
hace 4 meses |
Tobi Lutke
|
32706a720f
Refactor finetune folder: train/rl scripts with YAML configs
|
hace 4 meses |
Tobi Lutke
|
d32e13c172
Add HuggingFace login and comprehensive scoring to GRPO v2 training
|
hace 4 meses |
Tobi Lutke
|
c35dbd6cbd
Add comprehensive scoring system for query expansion
|
hace 4 meses |
Tobi Lutke
|
994a094546
Update README with final evaluation results
|
hace 4 meses |
Tobi Lutke
|
0353994e7d
Fix GRPO training script for TRL API compatibility
|
hace 4 meses |
Tobi Lutke
|
7cca164dd9
Add query expansion model finetuning infrastructure
|
hace 4 meses |
komsit37
|
88f78314bb
Fix sqlite-vec loading with BREW_PREFIX (#42)
|
hace 4 meses |
Tobias Lütke
|
3c7dfad1b6
Make docid lookup more lenient with quotes support (#39)
|
hace 4 meses |
Joshua Lelon Mitchell
|
fbd7fe8c8e
Fix docid lookup in qmd get command (#36)
|
hace 4 meses |
Tobias Lütke
|
5b1671d2f6
Merge pull request #38 from odysseus0/fix/readme-model-sizes
|
hace 4 meses |
George Zhang
|
c8f72de12e
docs: fix query expansion model size (Qwen3-1.7B, not 0.6B)
|
hace 4 meses |
Tobi Lutke
|
7817dc11a4
Show embedding notice only once at end of qmd update
|
hace 4 meses |
Tobias Lütke
|
6fbad4e9a6
Merge pull request #15 from gavrix/main
|
hace 4 meses |