suby/qmd

Autor	SHA1 Mensaxe	Data
Tobi Lutke	5ab78d00a2 Add HF Jobs scripts, temporal query examples, and training results	hai 3 meses
Tobi Lutke	354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline	hai 3 meses
Tobi Lutke	b9b1b39a76 Update README with separate model repos	hai 4 meses
Tobi Lutke	312c281109 Update README for unified model repository structure	hai 4 meses
Tobi Lutke	f96766cce8 Fix GRPO model loading to use SFT base first	hai 4 meses
Tobi Lutke	f6a6716c44 Refactor evals into separate run and score scripts	hai 4 meses
Tobi Lutke	6062dc769f Add named entity extraction to GRPO reward function	hai 4 meses
Tobi Lutke	994a094546 Update README with final evaluation results	hai 4 meses
Tobi Lutke	7cca164dd9 Add query expansion model finetuning infrastructure	hai 4 meses