| .. |
|
gepa
|
1d7d167b29
finetune: strict Pydantic schema, one canonical data format
|
il y a 3 mois |
|
grpo
|
189916d6fb
Move GRPO training out of default finetune pipeline
|
il y a 3 mois |
|
lfm2
|
1d7d167b29
finetune: strict Pydantic schema, one canonical data format
|
il y a 3 mois |