Tobi Lutke
|
d32e13c172
Add HuggingFace login and comprehensive scoring to GRPO v2 training
|
4 ヶ月 前 |
Tobi Lutke
|
0353994e7d
Fix GRPO training script for TRL API compatibility
|
4 ヶ月 前 |
Tobi Lutke
|
7cca164dd9
Add query expansion model finetuning infrastructure
|
4 ヶ月 前 |