Tobias Lütke
|
171e9e3e65
Merge pull request #530 from kuishou68/fix-status-no-build-probe
|
hace 1 mes |
Jeff Gardner
|
1ecb5c9f96
Fix QMD_LLAMA_GPU backend override handling
|
hace 1 mes |
cocoon
|
26e3d0c077
fix(status): avoid build attempts during device probe
|
hace 1 mes |
JohnRichardEnders
|
50ce17bbfa
feat(llm): resolve models as config > env > default
|
hace 1 mes |
Tobi Lutke
|
55f16460d0
fix(ci): guard LLM calls in CI and increase test timeouts
|
hace 2 meses |
Tobi Lutke
|
e3549dab1a
perf(rerank): cap parallelism, deduplicate chunks, cache by content
|
hace 2 meses |
Brian Le
|
0dec1df047
fix(llm): make expansion context size configurable
|
hace 2 meses |
Tobi Lütke
|
5233e676d9
fix(rerank): truncate documents exceeding 2048-token context size
|
hace 3 meses |
Tobi Lutke
|
870d3aed3b
test: move all tests to flat test/ directory
|
hace 3 meses |