Local Read MCP Server

MCP server for local document processing — structured extraction from PDFs, Office documents, HTML, ZIP archives, and optional image analysis. Designed to complement an agent's built-in Read tool, not duplicate it.

Tools

Tool exposure is startup-time conditional:

Always available: process_binary_file
Available only when vision config exists (VISION_API_KEY or OPENAI_API_KEY): analyze_image

Tool	When to use
`process_binary_file`	Convert supported binary/document/archive files to structured output before reading. Saves to `.local_read_mcp/`.
`analyze_image`	Analyze images via Vision API (Doubao, GPT-4o, etc.). Result saved to `.local_read_mcp/analysis/`.

Quick Start

git clone https://github.com/ansatzX/Local_Read_MCP.git
cd Local_Read_MCP

uv venv .venv && source .venv/bin/activate
uv pip install -e .

Configure MCP in ~/.claude/settings.json:

{
  "mcpServers": [{
    "command": "uv",
    "args": ["--directory", "/path/to/Local_Read_MCP", "run", "python", "-m", "local_read_mcp.server"]
  }]
}

Vision API (Optional)

Create .env in the repository root:

VISION_API_KEY=sk-xxx
VISION_BASE_URL=https://ark.cn-beijing.volces.com/api/v3
VISION_MODEL=doubao-seed-1-8-251228

At startup, the server loads the repository-root .env and then reads environment variables. Existing process environment variables take precedence over .env values. analyze_image is registered only if the server sees vision API key env vars at startup. After changing env values, restart the MCP server.

MinerU VLM-HYBRID Backend (Optional)

For high-quality PDF parsing with layout analysis, formula recognition, and table detection:

pip install "local-read-mcp[mineru]"

# Download models into the project directory
mineru-models-download --models-dir ./models

# Configure model paths
cp mineru.json.template mineru.json
# Edit mineru.json if models are elsewhere

Models downloaded (~4.5GB total):

models/pipeline/ — layout detection, OCR, formula recognition, table structure
models/vlm/ — fine-tuned Qwen2-VL for document understanding

How It Works

process_binary_file(file.pdf)
  ├─ format detection → backend selection
  │
  ├─ SIMPLE (local converters, no model/API dependency)
  │   └─ Built-in converters + MarkItDown fallback: PyMuPDF / mammoth / openpyxl / python-pptx
  │
  ├─ VLM-HYBRID (MinerU required, PDF only)
  │   └─ MinerU hybrid-auto-engine: VLM layout + pipeline OCR/formula/table
  │
  └─ chapter_split (auto for large PDFs)
      ├─ Detect sections via TOC / heading scan / fixed chunks
      ├─ Process each chunk independently (with page overlap)
      └─ Merge output.md + structural_toc.json

All results saved to .local_read_mcp/<file>_<timestamp>/:

intermediate.json — structured block representation
output.md — converted markdown
index.json — section/table/figure index
images/ — extracted images (when requested)

Figure extraction policy (current):

PDF image extraction is intentionally recall-first: it extracts embedded rasters and suspicious vector/image regions.
No deduplication is applied.
TODO: page-level truth judgment (whether a page truly contains a figure) and duplicate discrimination quality evaluation.

Development

uv run pytest          # Run tests
uv run ruff format .   # Format code
uv run ruff check .    # Lint

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
mineru.json.template		mineru.json.template
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local Read MCP Server

Tools

Quick Start

Vision API (Optional)

MinerU VLM-HYBRID Backend (Optional)

How It Works

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Local Read MCP Server

Tools

Quick Start

Vision API (Optional)

MinerU VLM-HYBRID Backend (Optional)

How It Works

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages