Tobi Lutke 6062dc769f Add named entity extraction to GRPO reward function 4 ヶ月 前
..
grpo_v4.yaml 6062dc769f Add named entity extraction to GRPO reward function 4 ヶ月 前
sft_v4.yaml 32706a720f Refactor finetune folder: train/rl scripts with YAML configs 4 ヶ月 前