suby/qmd

Autors	SHA1 Ziņojums	Datums
Tobias Lütke	eb1b77c8cb Deploy fine-tuned GRPO model as default query expansion (#67)	5 mēneši atpakaļ
Tobi Lutke	8572c2fd94 Deploy fine-tuned GRPO model as default for query expansion	5 mēneši atpakaļ
Tobi Lutke	5ab78d00a2 Add HF Jobs scripts, temporal query examples, and training results	5 mēneši atpakaļ
Tobi Lutke	354744af53 Finetune 2.0: consolidate and simplify the entire training pipeline	5 mēneši atpakaļ
jdvmi00	64c6e6c2e3 fix: rename collectionId to collectionName in searchVec for proper filtering (#61)	5 mēneši atpakaļ
Freeman Jiang	bfb0eebc3e fix: use sequential embedding on CPU-only systems to avoid race condition (#54)	5 mēneši atpakaļ
Copilot	053252ca24 Add Windows path utilities with cross-platform test coverage (#51)	5 mēneši atpakaļ
sh54	ba7391832d Add org-mode title extraction support (#50)	5 mēneši atpakaļ
sh54	65c0f89560 Enable SQLite extension loading in devshell (#48)	5 mēneši atpakaļ
Tobi Lutke	9b3a209a97 Fix GRPO training: apply chat template to prompts	5 mēneši atpakaļ
Tobi Lutke	3ea85eff50 Make TUI model list dynamic from HuggingFace Hub	5 mēneši atpakaļ
Tobi Lutke	891f3262cf Fix GRPO reward function to handle think blocks and end tokens	5 mēneši atpakaļ
Tobi Lutke	66bb8ed963 Remove beads reference from CLAUDE.md	5 mēneši atpakaļ
Tobi Lutke	2267986302 Remove beads issue tracking	5 mēneši atpakaļ
Tobi Lutke	8a1c4cdab0 Add 1.7B and 4B GRPO training and GGUF conversion scripts	5 mēneši atpakaļ
Tobi Lutke	b9b1b39a76 Update README with separate model repos	5 mēneši atpakaļ
Tobi Lutke	312c281109 Update README for unified model repository structure	5 mēneši atpakaļ
Tobi Lutke	2648512b7c Fix TUI to load GRPO models with SFT base first	5 mēneši atpakaļ
Tobi Lutke	f96766cce8 Fix GRPO model loading to use SFT base first	5 mēneši atpakaļ
Tobi Lutke	f6a6716c44 Refactor evals into separate run and score scripts	5 mēneši atpakaļ
Tobi Lutke	857a85ab58 Clean up evaluation files	5 mēneši atpakaļ
Tobi Lutke	dc8f5a2335 Strict format validation: every line must be lex:/vec:/hyde:	5 mēneši atpakaļ
Tobi Lutke	2ad507a86e Add chat template leakage detection to reward function	5 mēneši atpakaļ
Tobi Lutke	6062dc769f Add named entity extraction to GRPO reward function	5 mēneši atpakaļ
Tobi Lutke	32706a720f Refactor finetune folder: train/rl scripts with YAML configs	5 mēneši atpakaļ
Tobi Lutke	d32e13c172 Add HuggingFace login and comprehensive scoring to GRPO v2 training	5 mēneši atpakaļ
Tobi Lutke	c35dbd6cbd Add comprehensive scoring system for query expansion	5 mēneši atpakaļ
Tobi Lutke	994a094546 Update README with final evaluation results	5 mēneši atpakaļ
Tobi Lutke	0353994e7d Fix GRPO training script for TRL API compatibility	5 mēneši atpakaļ
Tobi Lutke	7cca164dd9 Add query expansion model finetuning infrastructure	5 mēneši atpakaļ

Jaunāki Vecāki

Revīziju vēsture Meklēt

Revīziju vēsture