|
|
hai 3 meses | |
|---|---|---|
| .. | ||
| README.md | hai 3 meses | |
| grpo.py | hai 3 meses | |
| grpo.yaml | hai 3 meses | |
This folder contains the experimental GRPO training path for query expansion. It is not part of the default production pipeline.
grpo.yaml – experimental GRPO hyperparametersgrpo.py – standalone GRPO training script# Recommended default: run from repo root
cd /home/tobi/qmd
uv run finetune/experiments/grpo/grpo.py
# Or use unified entrypoint (deprecated in main pipeline):
uv run train.py grpo --config finetune/experiments/grpo/grpo.yaml