$ timeahead_
← mcp scores
86
knowledge-rag

Drop docs, search instantly from Claude Code — 12 MCP tools, 20 format parsers, hybrid search + reranking. Zero servers, zero API keys, 100% local.

overview

What it does

Knowledge-rag is a local document retrieval system that integrates with Claude Code via 12 MCP tools. It uses hybrid search combining BM25 (keyword matching), semantic vector similarity, and cross-encoder reranking to find relevant passages in your documents. The system supports 20+ file formats including PDFs, markdown, code, and Jupyter notebooks. All processing runs locally via ONNX embeddings with optional NVIDIA GPU acceleration—no cloud APIs, no external servers, no data leaving your machine.

Who it's for

Developers building AI-assisted workflows who have local documentation, codebases, or knowledge bases they want Claude to search without uploading. This includes teams with proprietary docs, security-conscious organizations, and engineers who want fast, local-only retrieval without managing a database server.

Common use cases

  • Index internal documentation (API docs, architecture guides, runbooks) and search them natively from Claude Code prompts.
  • Build code-aware workflows by indexing your codebase and having Claude reference relevant files during development.
  • Create AI agents that ground their responses in your local knowledge without sharing data with cloud services.
  • Search meeting notes, research papers, or project notebooks to inform code generation or documentation writing.

Setup pitfalls

  • ONNX model loading: Versions prior to v3.8.0 loaded the embedding model (~200MB) at startup. Upgrade to v3.8.0+ for lazy loading, and ensure v3.8.1+ for a critical hotfix that prevents silent zero-vector corruption from model load failures.
  • Filesystem access: The server reads and writes to your data directory for indexing and caching. Ensure appropriate file permissions and sufficient disk space for embedding caches, which scale with document count.
  • Multi-process conflicts: Without the opt-in KNOWLEDGE_RAG_SINGLE_INSTANCE environment variable, multiple Claude Code windows or IDE extensions can spawn parallel instances sharing the same data_dir, potentially causing index contention.
install
add to your claude desktop / cursor / windsurf mcp config:
{
  "mcpServers": {
    "knowledge-rag": {
      "command": "uvx",
      "args": [
        "knowledge-rag"
      ]
    }
  }
}
per-client install guide (claude desktop · cursor · windsurf · vscode · claude code) →
owner of this server? claim your listing to get a verified badgeclaim →
score breakdown
security (35%)100
freshness (25%)100
adoption (20%)52
quality (10%)100
trust (10%)50
score history (8 updates)
5/10/20265/20/2026
capabilities · what this server can do
tool list unavailable — permissions from static analysis·auth: API key
high risk
● active   ○ not requested  ·  hover each badge for details
fs read fs write network exec eval secrets
why high risk: fs read + fs write + network + exec + secrets active — can execute code, access credentials, and make external network calls.
maintenance health
46/ 100 · is this project alive
contributors (1y)4
top contributor share90%
releases (1y)22
last release11d ago
ci✗ none
raw data
weekly downloads571
github stars82
forks14
open issues4
license✓ present
readme length52178 chars
last publish0d ago
last commit7d ago
last updated1d ago
install verified✓ pass · 3d ago
score drop alerts
get notified by email when this server's score drops 5+ points