Bek
|
e4990e470e
Harden embedding overflow handling
|
1 сар өмнө |
Tobias Lütke
|
171e9e3e65
Merge pull request #530 from kuishou68/fix-status-no-build-probe
|
1 сар өмнө |
Jeff Gardner
|
1ecb5c9f96
Fix QMD_LLAMA_GPU backend override handling
|
1 сар өмнө |
cocoon
|
26e3d0c077
fix(status): avoid build attempts during device probe
|
1 сар өмнө |
JohnRichardEnders
|
50ce17bbfa
feat(llm): resolve models as config > env > default
|
1 сар өмнө |
Tobi Lutke
|
55f16460d0
fix(ci): guard LLM calls in CI and increase test timeouts
|
2 сар өмнө |
Tobi Lutke
|
e3549dab1a
perf(rerank): cap parallelism, deduplicate chunks, cache by content
|
2 сар өмнө |
Brian Le
|
0dec1df047
fix(llm): make expansion context size configurable
|
2 сар өмнө |
Tobi Lütke
|
5233e676d9
fix(rerank): truncate documents exceeding 2048-token context size
|
3 сар өмнө |
Tobi Lutke
|
870d3aed3b
test: move all tests to flat test/ directory
|
3 сар өмнө |