Add llamacpp-cpu-qwen3-embed (CPU embedding) extension by kh0pper · Pull Request #1 · kh0pper/crow-addons

kh0pper · 2026-06-29T01:21:48Z

CPU-only Qwen3-Embedding-0.6B via llama.cpp, OpenAI-compatible /v1/embeddings on port 8007. Runs on macOS/Windows Docker Desktop — no GPU required.

Mirrors the bundle added upstream in kh0pper/crow#111. Adds llamacpp-cpu-qwen3-embed/ (crow-addon.json + docker-compose.yml) and the registry.json entry.

🤖 Generated with Claude Code

CPU-only Qwen3-Embedding-0.6B via llama.cpp, OpenAI-compatible /v1/embeddings on port 8007. Runs on macOS/Windows Docker Desktop — no GPU. Mirrors the bundle added to kh0pper/crow upstream. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

… 8192 The manifest declared contextLen 32768 while docker-compose serves --ctx-size 8192, so inputs over 8K tokens would be silently rejected despite the advertised capacity. Lower the declared contextLen (crow-addon.json + registry.json entry) to match what the CPU server actually serves. Vector space is unchanged (1024-dim, same model) — embeddings stay interchangeable with the GPU bundles; only max input length differs. Mirrors the same fix in crow PR #111.

DAYANE GRISEL PALACIOS TORRES and others added 2 commits June 28, 2026 19:21

kh0pper merged commit 3ed7299 into main Jun 29, 2026

kh0pper deleted the add/llamacpp-cpu-qwen3-embed branch June 29, 2026 01:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llamacpp-cpu-qwen3-embed (CPU embedding) extension#1

Add llamacpp-cpu-qwen3-embed (CPU embedding) extension#1
kh0pper merged 2 commits into
mainfrom
add/llamacpp-cpu-qwen3-embed

kh0pper commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kh0pper commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant