Tobias Lütke
|
55c951b15e
Merge pull request #349 from byheaven/fix/qwen3-embedding-model-filename-case
|
пре 2 месеци |
Tobi Lutke
|
55f16460d0
fix(ci): guard LLM calls in CI and increase test timeouts
|
пре 2 месеци |
Tobi Lutke
|
c68904fe08
refactor: move CLI and MCP to subdirectories, MCP consumes SDK
|
пре 2 месеци |
YuBai
|
740b17b485
docs: fix Qwen3-Embedding GGUF filename case in README and llm.ts
|
пре 2 месеци |
Tobi Lutke
|
ad38c1f698
feat: add intent parameter for query disambiguation
|
пре 2 месеци |
Tobi Lutke
|
e3549dab1a
perf(rerank): cap parallelism, deduplicate chunks, cache by content
|
пре 2 месеци |
Tobias Lütke
|
7904ab9a9d
Merge pull request #273 from daocoding/feature/configurable-embed-model
|
пре 2 месеци |
Tobias Lütke
|
ee08997f23
Merge pull request #313 from 0xble/fix/expand-context-size-config
|
пре 2 месеци |
Brian Le
|
0dec1df047
fix(llm): make expansion context size configurable
|
пре 2 месеци |
Gilad S.
|
3095041e0f
feat: use `build: "autoAttempt"` on `getLlama`
|
пре 2 месеци |
Big (daocoding)
|
b71649b12d
feat: add QMD_EMBED_MODEL env var for multilingual embedding support
|
пре 2 месеци |
Tobi Lütke
|
5233e676d9
fix(rerank): truncate documents exceeding 2048-token context size
|
пре 3 месеци |
Tobias Lütke
|
67e2aab18c
Merge pull request #206 from tobi/liquidai-query-expansion
|
пре 3 месеци |
Tobi Lütke
|
57f7caa93b
feat: add LiquidAI LFM2 support for query expansion
|
пре 3 месеци |
Tobi Lutke
|
09803a75b7
feat: compile to JS for npm, release system, full changelog
|
пре 3 месеци |
Tobi Lütke
|
392934e78a
perf: CPU parallelism via multi-context thread splitting
|
пре 3 месеци |
Tobi Lütke
|
0a941c442f
perf: flash attention, right-sized contexts, cleaner GPU detection
|
пре 3 месеци |
Tobi Lütke
|
4ac95b5e26
perf: adaptive parallel contexts for embed + rerank, fix VRAM waste
|
пре 3 месеци |
Tobi Lütke
|
0a0e1e6f29
perf: parallel reranking with multiple contexts (2.7x speedup)
|
пре 3 месеци |
Tobi Lütke
|
ee86bba45e
feat: auto-detect GPU acceleration + device info in status
|
пре 3 месеци |
Tobi Lütke
|
102ff861d3
fix: use Qwen3 recommended sampling params to prevent repetition loops
|
пре 3 месеци |
Tobi Lütke
|
479b68bbf1
add qmd model pull and refresh logic
|
пре 3 месеци |
Tobi Lutke
|
7de18ee066
Merge main into finetune
|
пре 3 месеци |
Tobi Lutke
|
785620467a
refactor: reorder output format to put hyde line first
|
пре 3 месеци |
Tobi Lütke
|
32d313ad6b
Add LLM session management for lifecycle safety
|
пре 3 месеци |
Christopher Jones
|
6d9871d2f5
Fix DisposedError during slow batch embedding (#41)
|
пре 3 месеци |
Tobias Lütke
|
eb1b77c8cb
Deploy fine-tuned GRPO model as default query expansion (#67)
|
пре 3 месеци |
Tobi Lutke
|
8572c2fd94
Deploy fine-tuned GRPO model as default for query expansion
|
пре 3 месеци |
Freeman Jiang
|
bfb0eebc3e
fix: use sequential embedding on CPU-only systems to avoid race condition (#54)
|
пре 3 месеци |
Sergey Gavrilyuk
|
bebee61bec
Fix case sensitivity in Qwen3-1.7B model filename
|
пре 4 месеци |