Historique des commits

Auteur SHA1 Message Date
  Tobi Lutke 5cf4958bfa Add HuggingFace model card YAML metadata to finetune README il y a 3 mois
  Tobi Lutke 8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion il y a 3 mois
  Tobi Lutke 5ab78d00a2 Add HF Jobs scripts, temporal query examples, and training results il y a 3 mois
  Tobi Lutke 354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline il y a 3 mois
  Tobi Lutke 9b3a209a97 Fix GRPO training: apply chat template to prompts il y a 4 mois
  Tobi Lutke 3ea85eff50 Make TUI model list dynamic from HuggingFace Hub il y a 4 mois
  Tobi Lutke 891f3262cf Fix GRPO reward function to handle think blocks and end tokens il y a 4 mois
  Tobi Lutke 66bb8ed963 Remove beads reference from CLAUDE.md il y a 4 mois
  Tobi Lutke 2267986302 Remove beads issue tracking il y a 4 mois
  Tobi Lutke 8a1c4cdab0 Add 1.7B and 4B GRPO training and GGUF conversion scripts il y a 4 mois
  Tobi Lutke b9b1b39a76 Update README with separate model repos il y a 4 mois
  Tobi Lutke 312c281109 Update README for unified model repository structure il y a 4 mois
  Tobi Lutke 2648512b7c Fix TUI to load GRPO models with SFT base first il y a 4 mois
  Tobi Lutke f96766cce8 Fix GRPO model loading to use SFT base first il y a 4 mois
  Tobi Lutke f6a6716c44 Refactor evals into separate run and score scripts il y a 4 mois
  Tobi Lutke 857a85ab58 Clean up evaluation files il y a 4 mois
  Tobi Lutke dc8f5a2335 Strict format validation: every line must be lex:/vec:/hyde: il y a 4 mois
  Tobi Lutke 2ad507a86e Add chat template leakage detection to reward function il y a 4 mois
  Tobi Lutke 6062dc769f Add named entity extraction to GRPO reward function il y a 4 mois
  Tobi Lutke 32706a720f Refactor finetune folder: train/rl scripts with YAML configs il y a 4 mois
  Tobi Lutke d32e13c172 Add HuggingFace login and comprehensive scoring to GRPO v2 training il y a 4 mois
  Tobi Lutke c35dbd6cbd Add comprehensive scoring system for query expansion il y a 4 mois
  Tobi Lutke 994a094546 Update README with final evaluation results il y a 4 mois
  Tobi Lutke 0353994e7d Fix GRPO training script for TRL API compatibility il y a 4 mois
  Tobi Lutke 7cca164dd9 Add query expansion model finetuning infrastructure il y a 4 mois
  komsit37 88f78314bb Fix sqlite-vec loading with BREW_PREFIX (#42) il y a 4 mois
  Tobias Lütke 3c7dfad1b6 Make docid lookup more lenient with quotes support (#39) il y a 4 mois
  Joshua Lelon Mitchell fbd7fe8c8e Fix docid lookup in qmd get command (#36) il y a 4 mois
  Tobias Lütke 5b1671d2f6 Merge pull request #38 from odysseus0/fix/readme-model-sizes il y a 4 mois
  George Zhang c8f72de12e docs: fix query expansion model size (Qwen3-1.7B, not 0.6B) il y a 4 mois