From eb5fdb693f5fe4da60f49c856125ddbdecbe23d7 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Wed, 27 May 2026 19:55:40 +1200
Subject: [PATCH 1/9] feat(dws): add /extraction/parse support to
 document-processor-api skill
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Teach the DWS skill how to call the now-GA /extraction/parse endpoint:

- scripts/parse.py — single primitive that accepts a local file plus
  mode and output_format, calls client.parse(), and writes the result.
  Modes: text (1 cr/pg), structure (1.5 cr/pg, default), understand
  (9 cr/pg), agentic (18 cr/pg). Output shapes: spatial elements or
  whole-document Markdown. Billed against extraction credits (separate
  from processor API credits). Prints usage summary after each run.

- references/parse-output-filtering.md — new reference doc showing
  downstream consumption patterns after a single /parse call: reading-
  order plain text, table-to-grid projection, key-value dict, formula
  LaTeX, picture alt descriptions. Includes Python snippets and jq
  one-liners for each pattern.

- references/script-catalog.md — adds parse.py entry under a new
  "Data Extraction" section with mode, cost, and output-shape summary.

- SKILL.md — adds a Data Extraction section covering: what /parse is
  (document-understanding primitive, not per-element-type calls), mode
  selection table keyed to user intent, default of structure+spatial
  for ambiguous requests, invocation examples, downstream-consumption
  quick-ref, and pointer to parse-output-filtering.md. Also updates
  skill description and task-scripts list.

Python client dependency: path-install of the local branch that adds
client.parse() support (file:// URL in the uv inline script header).
---
 .../skills/document-processor-api/SKILL.md    |  78 +++++-
 .../references/parse-output-filtering.md      | 236 ++++++++++++++++++
 .../document-processor-api/scripts/parse.py   | 147 +++++++++++
 3 files changed, 457 insertions(+), 4 deletions(-)
 create mode 100644 plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md
 create mode 100644 plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py

diff --git a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
index 7df3bd0..66bfacc 100644
--- a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
@@ -3,9 +3,11 @@ name: document-processor-api
 description: >-
   Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs,
   convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value
-  pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like
-  PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables,
-  redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.
+  pairs, parse documents into a structural model or Markdown (for RAG indexing, form/invoice
+  extraction, or layout-aware understanding), redact PII, watermark, sign, fill forms, optimize
+  PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge
+  these PDFs, OCR this scan, extract tables, parse this document, extract for RAG, redact PII,
+  sign this PDF, make this PDF/A, or linearize for web delivery.
 license: MIT
 metadata:
   author: nutrient-sdk
@@ -37,6 +39,7 @@ Use Nutrient DWS for managed document workflows where fidelity, compliance, or m
 - Generate PDFs from HTML templates, uploaded assets, or remote URLs.
 - Convert Office, HTML, image, and PDF files between supported formats.
 - OCR scans and extract text, tables, or key-value pairs.
+- Parse a document into its structural model or whole-document Markdown for RAG indexing, form/invoice extraction, or layout-aware understanding.
 - Redact PII, watermark, sign, fill forms, merge, split, rotate, flatten, or encrypt PDFs.
 - Produce delivery targets like PDF/A, PDF/UA, optimized PDFs, or linearized PDFs.
 - Check credits before large, batch, or AI-heavy runs.
@@ -47,7 +50,73 @@ Use Nutrient DWS for managed document workflows where fidelity, compliance, or m
 3. Use the modular `references/` docs and direct API payloads for capabilities that do not yet have a dedicated helper script, especially HTML/URL generation and compliance tuning.
 4. Use local PDF utilities only for lightweight inspection. Use Nutrient when output fidelity or compliance matters.
 
+## Data Extraction (`/extraction/parse`)
+
+Use `scripts/parse.py` for any task involving document understanding, content extraction,
+RAG indexing, form data extraction, or layout analysis.
+
+**`/extraction/parse` is a document-understanding primitive**: one call returns the full
+structural document model — typed elements with bounding boxes, confidence scores, and
+reading order — or a whole-document Markdown string. You always receive all element types
+in a single call.
+
+### Picking a mode
+
+Choose based on the user's intent and acceptable credit cost. All costs are
+**extraction credits per page** — a separate billing bucket from the processor API
+credits consumed by `/build`, `/sign`, OCR, and other Processor API endpoints.
+
+| User intent | Mode | Output format | Cost | Notes |
+|-------------|------|---------------|------|-------|
+| RAG / search indexing / content migration — born-digital PDF | `text` | `markdown` | 1 cr/pg | Cheapest path; no OCR or AI needed |
+| RAG / search indexing — scanned or image-based PDF | `structure` | `markdown` | 1.5 cr/pg | OCR required before Markdown assembly |
+| Form / invoice extraction | `understand` | `spatial` | 9 cr/pg | AI classification for reliable key-value and table detection |
+| Layout-aware document understanding | `understand` | `spatial` | 9 cr/pg | Semantic paragraph roles (Title, SectionHeader, etc.) |
+| Deep visual understanding (charts, diagrams, alt text) | `agentic` | `spatial` | 18 cr/pg | VLM adds alt descriptions on every picture element |
+| **Default / ambiguous intent** | **`structure`** | **`spatial`** | **1.5 cr/pg** | Good balance: OCR + spatial elements, low cost |
+
+When the user's intent is unclear, **default to `structure` mode with `spatial` output**
+(1.5 extraction credits per page). Explain the cost/quality options and ask if a
+different mode is preferable before running on large documents.
+
+### Invocation
+
+```bash
+# Default: structure mode, spatial output
+uv run scripts/parse.py --input doc.pdf --out out.json
+
+# Markdown for RAG (text mode — cheapest)
+uv run scripts/parse.py --input doc.pdf --out out.md --output-format markdown --mode text
+
+# Form extraction (understand mode)
+uv run scripts/parse.py --input doc.pdf --out out.json --mode understand
+
+# Agentic (VLM alt text on pictures)
+uv run scripts/parse.py --input doc.pdf --out out.json --mode agentic
+```
+
+The script prints extraction-credit usage after each run so you can verify the cost.
+
+### Downstream consumption
+
+After a single `/parse` call, slice the response for common needs:
+
+- **Reading-order plain text**: walk `output.elements` sorted by `(page.pageIndex, readingOrder)`, join `paragraph` and `handwriting` `text` fields
+- **Tables**: project `cells[]` on each `table` element into rows/columns using `cell.row` and `cell.column`
+- **Key-value pairs**: read `pairs[]` on each `keyValueRegion` element — each pair has `.key.value` and `.value.value`
+- **Formulas**: read `latex` on each `formula` element
+- **Pictures**: read `classification` and `altDescription` (populated by `agentic` mode) on each `picture` element
+- **Markdown output**: call with `--output-format markdown`; the script writes the Markdown string directly
+
+Full patterns with Python snippets and jq one-liners: `references/parse-output-filtering.md`
+
+### Input constraint
+
+`parse.py` only accepts **local file paths** — the underlying API endpoint is
+multipart-only. For remote inputs, download the file first.
+
 ## Single-operation scripts
+- `parse.py` -> document understanding via `/extraction/parse` (structural model or whole-document Markdown)
 - `convert.py` -> convert between `pdf`, `pdfa`, `pdfua`, `docx`, `xlsx`, `pptx`, `png`, `jpeg`, `webp`, `html`, and `markdown`
 - `merge.py` -> merge multiple files into one PDF
 - `split.py` -> split one PDF into multiple PDFs by page ranges
@@ -79,6 +148,7 @@ When the user asks for multiple operations in one run:
 - `split.py` requires a multi-page PDF and cannot extract ranges from a single-page document.
 - `delete-pages.py` must retain at least one page and cannot delete the entire document.
 - `sign.py` only accepts local file paths for the main PDF.
+- `parse.py` only accepts local file paths (the `/extraction/parse` endpoint is multipart-only).
 
 ## Decision rules
 - Prefer a helper script when one already covers the requested operation cleanly.
@@ -107,6 +177,7 @@ Read only what you need:
 - `references/generation-and-conversion.md` -> HTML/URL generation and format conversion
 - `references/pdf-manipulation.md` -> merge, split, page-range, rotate, and flatten workflows
 - `references/extraction-and-ocr.md` -> OCR, text extraction, tables, and key-value workflows
+- `references/parse-output-filtering.md` -> `/extraction/parse` downstream consumption patterns (reading-order text, tables, key-values, formulas, pictures)
 - `references/security-signing-and-forms.md` -> redaction, watermarking, signatures, forms, and passwords
 - `references/compliance-and-optimization.md` -> PDF/A, PDF/UA, optimization, and linearization
 - `references/workflow-recipes.md` -> end-to-end sequencing patterns for common business document workflows
@@ -127,4 +198,3 @@ Read only what you need:
   - Use process env injection at runtime (shell/export, secrets manager, or host env).
 - Restrict file access with `SANDBOX_PATH` to the minimum required working directory.
 - Before enabling MCP mode in production, verify package provenance and lock version.
-
diff --git a/plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md b/plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md
new file mode 100644
index 0000000..9c7e525
--- /dev/null
+++ b/plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md
@@ -0,0 +1,236 @@
+# Parse Output — Filtering and Downstream Patterns
+
+`/extraction/parse` returns a single document model in one call. You always receive all
+element types at once — there is no per-type call. This document shows how to slice the
+response into the shapes that downstream pipelines commonly need.
+
+All examples below assume you have already run `parse.py` with `--output-format spatial`
+and saved the response to `out.json`.
+
+---
+
+## Response structure
+
+```
+ParseResponse
+├── output
+│   ├── elements[]          (spatial mode)  — typed element list
+│   └── markdown            (markdown mode) — whole-document Markdown string
+├── metrics
+│   ├── pagesProcessed
+│   └── processingTimeMs
+└── usage
+    └── dataExtractionCredits
+        ├── cost            — extraction credits used by this call
+        └── remainingCredits
+```
+
+### Element types (discriminated on `type`)
+
+| type             | Key fields                                                      | Modes that produce it         |
+|------------------|-----------------------------------------------------------------|-------------------------------|
+| `paragraph`      | `text`, `role`, `words[]`, `bounds`, `readingOrder`             | all                           |
+| `table`          | `rowCount`, `columnCount`, `cells[]`, `bounds`, `readingOrder`  | structure / understand / agentic |
+| `formula`        | `latex`, `bounds`                                               | understand / agentic          |
+| `picture`        | `classification`, `altDescription`, `bounds`                    | all (agentic adds VLM alt text) |
+| `keyValueRegion` | `pairs[]` (each with `key`/`value` entities + bounds)           | understand / agentic          |
+| `handwriting`    | `text`, `words[]`, `bounds`                                     | understand / agentic          |
+
+---
+
+## Reading-order plain text
+
+Walk elements in `(page.pageIndex, readingOrder)` order, collect `text` from
+`paragraph` and `handwriting` elements, join with newlines.
+
+```python
+import json
+
+with open("out.json") as f:
+    response = json.load(f)
+
+elements = response["output"]["elements"]
+
+text_elements = [
+    e for e in elements
+    if e.get("type") in ("paragraph", "handwriting") and e.get("text")
+]
+
+text_elements.sort(
+    key=lambda e: (e.get("page", {}).get("pageIndex", 0), e.get("readingOrder", 0))
+)
+
+plain_text = "\n\n".join(e["text"] for e in text_elements)
+print(plain_text)
+```
+
+### jq equivalent
+
+```bash
+jq -r '
+  [.output.elements[]
+   | select(.type == "paragraph" or .type == "handwriting")
+   | select(.text != null)
+  ]
+  | sort_by([.page.pageIndex // 0, .readingOrder // 0])
+  | .[].text
+' out.json | paste -sd '\n\n' /dev/stdin
+```
+
+---
+
+## Tables — rows and columns dict
+
+Each `TableElement` carries a flat `cells[]` list. Reconstruct rows/columns by grouping
+on `row` and `column` (both 0-indexed). Multi-span cells span `rowSpan` rows and
+`colSpan` columns.
+
+```python
+def table_to_grid(table: dict) -> list[list[str]]:
+    """Return a list-of-rows, each row a list of cell text values."""
+    rows = table.get("rowCount", 0)
+    cols = table.get("columnCount", 0)
+    grid = [[""] * cols for _ in range(rows)]
+    for cell in table.get("cells") or []:
+        r, c = cell.get("row", 0), cell.get("column", 0)
+        if r < rows and c < cols:
+            grid[r][c] = cell.get("text", "")
+    return grid
+
+
+tables = [e for e in elements if e.get("type") == "table"]
+for i, table in enumerate(tables):
+    print(f"Table {i} (page {table.get('page', {}).get('pageIndex', 0)}):")
+    for row in table_to_grid(table):
+        print(" | ".join(row))
+```
+
+### jq — extract all table cells as JSON
+
+```bash
+jq '[
+  .output.elements[]
+  | select(.type == "table")
+  | {
+      page: .page.pageIndex,
+      readingOrder,
+      rowCount,
+      columnCount,
+      rows: (
+        [ .cells[]? | {row, col: .column, text} ]
+        | group_by(.row)
+        | map(sort_by(.col) | map(.text))
+      )
+    }
+]' out.json
+```
+
+---
+
+## Key-value regions — key/value dict
+
+`keyValueRegion` elements carry a `pairs[]` list. Each pair has a `key` entity and a
+`value` entity, both with a `value` string field.
+
+```python
+kv_regions = [e for e in elements if e.get("type") == "keyValueRegion"]
+for region in kv_regions:
+    for pair in region.get("pairs") or []:
+        key_text = pair.get("key", {}).get("value", "")
+        val_text = pair.get("value", {}).get("value", "")
+        confidence = pair.get("relationshipConfidence")
+        print(f"{key_text!r}: {val_text!r}  (confidence={confidence})")
+```
+
+### jq equivalent
+
+```bash
+jq '[
+  .output.elements[]
+  | select(.type == "keyValueRegion")
+  | .pairs[]?
+  | { key: .key.value, value: .value.value, confidence: .relationshipConfidence }
+]' out.json
+```
+
+---
+
+## Filtering by element type
+
+```python
+from typing import Literal
+
+def filter_elements(elements: list[dict], type_: str) -> list[dict]:
+    return [e for e in elements if e.get("type") == type_]
+
+paragraphs   = filter_elements(elements, "paragraph")
+tables       = filter_elements(elements, "table")
+formulas     = filter_elements(elements, "formula")
+pictures     = filter_elements(elements, "picture")
+kv_regions   = filter_elements(elements, "keyValueRegion")
+handwriting  = filter_elements(elements, "handwriting")
+```
+
+### jq
+
+```bash
+# Count by type
+jq '.output.elements | group_by(.type) | map({(.[0].type): length}) | add' out.json
+
+# All tables on page 0
+jq '[.output.elements[] | select(.type == "table" and .page.pageIndex == 0)]' out.json
+```
+
+---
+
+## Formulas (LaTeX)
+
+```python
+formulas = [e for e in elements if e.get("type") == "formula" and e.get("latex")]
+for f in formulas:
+    print(f["latex"])
+```
+
+---
+
+## Pictures with alt descriptions (agentic mode)
+
+`agentic` mode uses a vision language model to generate `altDescription` on every
+`picture` element. Other modes leave `altDescription` absent or empty.
+
+```python
+pictures = [e for e in elements if e.get("type") == "picture"]
+for pic in pictures:
+    print(f"[{pic.get('classification', 'unknown')}] {pic.get('altDescription', '')}")
+```
+
+---
+
+## Checking extraction-credit cost
+
+```python
+usage = response.get("usage", {})
+credits = usage.get("dataExtractionCredits", {})
+print(f"Cost: {credits.get('cost')} extraction credits")
+print(f"Remaining: {credits.get('remainingCredits')}")
+```
+
+Note: `dataExtractionCredits` reflects charges from the **extraction credits** bucket,
+which is separate from the **processor API credits** bucket used by `/build`, `/sign`,
+OCR, and other Processor API endpoints.
+
+---
+
+## Mode selection guide
+
+| Intent | Recommended mode | Cost | Why |
+|--------|-----------------|------|-----|
+| RAG / search indexing / content migration — born-digital PDF | `text` + `markdown` output | 1 cr/pg | No OCR needed; fastest path to a Markdown string |
+| RAG / search indexing — scanned or image PDF | `structure` + `markdown` output | 1.5 cr/pg | OCR required before Markdown assembly |
+| Form / invoice extraction | `understand` + `spatial` output | 9 cr/pg | AI classification needed for reliable key-value and table detection |
+| Layout-aware document understanding | `understand` + `spatial` output | 9 cr/pg | Semantic classification of paragraphs (Title, SectionHeader, etc.) |
+| Deep visual understanding (charts, diagrams) | `agentic` + `spatial` output | 18 cr/pg | VLM generates alt descriptions on every picture element |
+| Default / unknown intent | `structure` + `spatial` output | 1.5 cr/pg | Good balance: spatial elements with OCR, low cost |
+
+All costs are **extraction credits per page** — a separate billing bucket from
+processor API credits.
diff --git a/plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py b/plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py
new file mode 100644
index 0000000..1c87e8d
--- /dev/null
+++ b/plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py
@@ -0,0 +1,147 @@
+#!/usr/bin/env python3
+# /// script
+# requires-python = ">=3.10"
+# dependencies = ["nutrient-dws>=3.1.0"]
+# ///
+"""Parse a document using the Nutrient Data Extraction API (/extraction/parse).
+
+This script is the single primitive for document understanding via /extraction/parse.
+One call returns the full structural document model — typed elements with bounding boxes,
+confidence scores, and reading order — or a whole-document Markdown string.
+
+Billing note: /extraction/parse is billed against **extraction credits**, which are a
+separate billing bucket from the processor API credits consumed by /build, /sign, OCR,
+and other Processor API endpoints.
+
+Per-page extraction-credit costs by mode:
+  text:       1 extraction credit  — fast Markdown from born-digital documents (no OCR/AI)
+  structure:  1.5 extraction credits — OCR + spatial elements with bounding boxes
+  understand: 9 extraction credits  — AI layout analysis, table detection, semantic classification
+  agentic:    18 extraction credits — VLM-augmented; deepest visual understanding
+
+Output shapes:
+  spatial  (default): response.output.elements — typed elements list
+  markdown:           response.output.markdown — whole-document Markdown string
+
+Usage examples:
+  # Spatial elements (structure mode) — lowest-cost spatial extraction
+  uv run scripts/parse.py --input doc.pdf --out out.json
+
+  # Markdown for RAG / search indexing (text mode — cheapest)
+  uv run scripts/parse.py --input doc.pdf --out out.md --output-format markdown --mode text
+
+  # Form / invoice extraction (understand mode — typed elements with confidence)
+  uv run scripts/parse.py --input doc.pdf --out out.json --mode understand
+
+  # Deep visual understanding (agentic mode — VLM descriptions on pictures)
+  uv run scripts/parse.py --input doc.pdf --out out.json --mode agentic --output-format spatial
+"""
+
+import argparse
+import asyncio
+import json
+import sys
+from pathlib import Path
+
+sys.path.insert(0, str(Path(__file__).parent))
+from lib.common import create_client, handle_error
+
+
+async def main() -> None:
+    parser = argparse.ArgumentParser(
+        description=(
+            "Parse a document with the Nutrient Data Extraction API and write the result. "
+            "Billed against extraction credits (separate from processor API credits)."
+        ),
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="""
+Extraction credit costs per page:
+  text:       1 extraction credit  (born-digital Markdown, no OCR)
+  structure:  1.5 extraction credits (OCR + spatial elements)  [default]
+  understand: 9 extraction credits  (AI layout + table detection)
+  agentic:    18 extraction credits (VLM-augmented)
+
+Output shapes:
+  spatial  (default): typed element list at output.elements
+  markdown:           whole-document Markdown at output.markdown
+""",
+    )
+    parser.add_argument(
+        "--input",
+        required=True,
+        help="Path to the local input document (PDF, image, or Office file).",
+    )
+    parser.add_argument(
+        "--out",
+        required=True,
+        help="Output file path. Receives the full JSON response for spatial output, "
+        "or a .md file for markdown output.",
+    )
+    parser.add_argument(
+        "--mode",
+        choices=["text", "structure", "understand", "agentic"],
+        default="structure",
+        help=(
+            "Processing mode controlling cost and quality. "
+            "text=1cr, structure=1.5cr (default), understand=9cr, agentic=18cr — "
+            "all costs are extraction credits per page."
+        ),
+    )
+    parser.add_argument(
+        "--output-format",
+        dest="output_format",
+        choices=["spatial", "markdown"],
+        default="spatial",
+        help=(
+            "Shape of the output. "
+            "spatial: typed elements with bounds (default). "
+            "markdown: whole-document Markdown string."
+        ),
+    )
+    args = parser.parse_args()
+
+    # Validate input is a local file (the /extraction/parse endpoint is multipart-only)
+    input_path = Path(args.input)
+    if not input_path.exists():
+        print(f"Error: input file not found: {args.input}", file=sys.stderr)
+        sys.exit(1)
+
+    client = create_client()
+    response = await client.parse(
+        input_path,
+        mode=args.mode,
+        output_format=args.output_format,
+    )
+
+    out_path = Path(args.out)
+    out_path.parent.mkdir(parents=True, exist_ok=True)
+
+    if args.output_format == "markdown":
+        markdown = response.get("output", {}).get("markdown", "")
+        out_path.write_text(markdown, encoding="utf-8")
+        print(f"Wrote {args.out}")
+    else:
+        with open(out_path, "w", encoding="utf-8") as f:
+            json.dump(response, f, indent=2)
+        print(f"Wrote {args.out}")
+
+    # Print usage summary so callers can see credit cost without opening the output file
+    usage = response.get("usage", {})
+    credits_info = usage.get("dataExtractionCredits", {})
+    cost = credits_info.get("cost")
+    remaining = credits_info.get("remainingCredits")
+    metrics = response.get("metrics", {})
+    pages = metrics.get("pagesProcessed", "?")
+    if cost is not None:
+        remaining_str = f", remaining: {remaining}" if remaining is not None else ""
+        print(
+            f"Usage: {cost} extraction credits ({pages} page(s) at {args.mode} mode"
+            f"{remaining_str})"
+        )
+
+
+if __name__ == "__main__":
+    try:
+        asyncio.run(main())
+    except Exception as e:
+        handle_error(e)

From 3a48851bfcdc89aa5a3925dc15928adaedf58c33 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Wed, 27 May 2026 22:09:02 +1200
Subject: [PATCH 2/9] feat(dws): split /extraction/parse into a dedicated
 document-extraction-api skill
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

DWS Extract is a separate product from DWS Processor — different API key,
different credit pool, different billing. Splitting the parse primitive
into its own skill removes the conflation and lets agents pick the right
product upfront.

- New skill: plugins/nutrient-dws/skills/document-extraction-api
  - parse.py + references/parse-output-filtering.md moved over via git mv
  - SKILL.md focused on the Data Extraction product, mode/output table,
    downstream consumption patterns, and the separate NUTRIENT_EXTRACT_API_KEY
  - Local lib/common.py with create_client() that reads
    NUTRIENT_EXTRACT_API_KEY (falls back to NUTRIENT_API_KEY for tenants on
    global keys) and constructs NutrientClient(api_key=..., extract_api_key=...)
  - Pinned to nutrient-dws>=3.1.0 in the script's PEP 723 metadata

- document-processor-api: removed the Data Extraction section, the parse.py
  entry, and the parse-output-filtering reference map row. Cross-link to the
  sibling skill in the frontmatter description and "When to use" section.

- AGENTS.md: advertise the new skill alongside the existing two.

- Fix latent bug in parse.py: was reading usage.dataExtractionCredits
  (camelCase) but the API returns data_extraction_credits (snake_case), so
  the credit-usage summary was silently skipped on every call. Confirmed
  end-to-end via live smoke (6-page PDF, structure/spatial mode, 9 credits,
  ~46KB JSON, usage summary now prints correctly).
---
 AGENTS.md                                     |   3 +-
 .../skills/document-extraction-api/.gitignore |   2 +
 .../skills/document-extraction-api/SKILL.md   | 128 ++++++++++++++++++
 .../references/parse-output-filtering.md      |   0
 .../scripts/lib/common.py                     | 103 ++++++++++++++
 .../scripts/parse.py                          |   6 +-
 .../skills/document-processor-api/SKILL.md    |  89 ++----------
 7 files changed, 252 insertions(+), 79 deletions(-)
 create mode 100644 plugins/nutrient-dws/skills/document-extraction-api/.gitignore
 create mode 100644 plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
 rename plugins/nutrient-dws/skills/{document-processor-api => document-extraction-api}/references/parse-output-filtering.md (100%)
 create mode 100644 plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
 rename plugins/nutrient-dws/skills/{document-processor-api => document-extraction-api}/scripts/parse.py (95%)

diff --git a/AGENTS.md b/AGENTS.md
index 923ead4..2c47453 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -6,5 +6,6 @@ Each skill lives under `plugins/<plugin-name>/skills/<skill-name>/SKILL.md`. Rea
 
 ## Available Skills
 
-- **nutrient-dws / document-processor-api** — Convert, extract, transform, and secure documents via the Nutrient Document Web Services API (Python scripts via `uv`).
+- **nutrient-dws / document-processor-api** — Convert, transform, redact, sign, watermark, OCR, and secure documents via the Nutrient DWS Processor API (Python scripts via `uv`).
+- **nutrient-dws / document-extraction-api** — Parse documents into a structural model (typed elements with bounds) or whole-document Markdown via the Nutrient DWS Data Extraction API (`/extraction/parse`). Use for RAG ingestion, layout analysis, and form/invoice extraction.
 - **pdf-to-markdown / pdf-to-markdown** — Extract text from PDFs as structured, semantic Markdown. Use when converting a PDF to Markdown, extracting text from a PDF, or processing one or more PDFs into Markdown output.
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/.gitignore b/plugins/nutrient-dws/skills/document-extraction-api/.gitignore
new file mode 100644
index 0000000..7a60b85
--- /dev/null
+++ b/plugins/nutrient-dws/skills/document-extraction-api/.gitignore
@@ -0,0 +1,2 @@
+__pycache__/
+*.pyc
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
new file mode 100644
index 0000000..817d6b1
--- /dev/null
+++ b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
@@ -0,0 +1,128 @@
+---
+name: document-extraction-api
+description: >-
+  Parse documents into a structural model or whole-document Markdown via the Nutrient Data
+  Extraction API (`/extraction/parse`). Use when the user wants to extract layout, tables,
+  key-value pairs, formulas, or images with bounding boxes; build a RAG ingestion pipeline;
+  produce Markdown for search indexing or content migration; or run layout-aware document
+  understanding. Triggers include parse this document, extract layout, RAG pipeline, document
+  understanding, form/invoice extraction, layout analysis, or whole-document Markdown.
+license: MIT
+metadata:
+  author: nutrient-sdk
+  version: "1.0"
+  homepage: "https://www.nutrient.io/api/"
+  repository: "https://github.com/PSPDFKit-labs/nutrient-skills"
+  compatibility: "Requires Python 3.10+, uv, and internet. Works with Claude Code, Codex CLI, Gemini CLI, OpenCode, Cursor, Windsurf, GitHub Copilot, Amp, or any Agent Skills-compatible product."
+  short-description: "Parse documents into a structural model or Markdown via Nutrient Data Extraction"
+---
+
+# Nutrient Data Extraction
+
+Use Nutrient DWS Extract for document-understanding workflows where you need typed
+elements (paragraphs, tables, formulas, pictures, key-value regions, handwriting) with
+bounding boxes — or a clean Markdown representation of the whole document.
+
+## When to use
+
+- Build a RAG ingestion pipeline: PDF -> Markdown -> chunks -> embeddings.
+- Index content for search or migrate documents into a new CMS.
+- Extract structured fields from forms and invoices (key/value pairs, tables, semantic regions).
+- Reconstruct page layout for downstream rendering or comparison.
+- Run layout-aware document understanding (semantic paragraph roles, table cell spans,
+  formulas in LaTeX, picture classification and alt descriptions).
+
+This skill is **only** for `/extraction/parse`. For PDF generation, conversion, OCR,
+redaction, signing, watermarking, or any `/build`-based workflow, use the sibling
+`document-processor-api` skill.
+
+## Setup
+
+DWS Extract is a separate product from DWS Processor and has its own API key.
+
+- Get a Nutrient DWS Extract API key at <https://dashboard.nutrient.io/>.
+- Export it as `NUTRIENT_EXTRACT_API_KEY`:
+  ```bash
+  export NUTRIENT_EXTRACT_API_KEY="pdf_live_..."
+  ```
+- Scripts live in `scripts/` relative to this SKILL.md. Use the directory containing this
+  SKILL.md as the working directory:
+  ```bash
+  cd <directory containing this SKILL.md> && uv run scripts/<script>.py --help
+  ```
+
+Calling `/extraction/parse` with a DWS Processor key returns `403`. If your tenant has been
+migrated to global DWS API keys, a single key set as either `NUTRIENT_EXTRACT_API_KEY` or
+`NUTRIENT_API_KEY` will work for both products.
+
+## `/extraction/parse` — one primitive, two output shapes
+
+One call returns the full structural document model — typed elements with bounding boxes,
+confidence scores, and reading order — or a whole-document Markdown string. You always
+receive all element types in a single call.
+
+### Picking a mode
+
+Choose based on the user's intent and acceptable credit cost. All costs are
+**extraction credits per page** — a separate billing bucket from the processor API
+credits consumed by `/build`, `/sign`, OCR, and other DWS Processor endpoints.
+
+| User intent | Mode | Output format | Cost | Notes |
+|-------------|------|---------------|------|-------|
+| RAG / search indexing / content migration — born-digital PDF | `text` | `markdown` | 1 cr/pg | Cheapest path; no OCR or AI needed |
+| RAG / search indexing — scanned or image-based PDF | `structure` | `markdown` | 1.5 cr/pg | OCR required before Markdown assembly |
+| Form / invoice extraction | `understand` | `spatial` | 9 cr/pg | AI classification for reliable key-value and table detection |
+| Layout-aware document understanding | `understand` | `spatial` | 9 cr/pg | Semantic paragraph roles (Title, SectionHeader, etc.) |
+| Deep visual understanding (charts, diagrams, alt text) | `agentic` | `spatial` | 18 cr/pg | VLM adds alt descriptions on every picture element |
+| **Default / ambiguous intent** | **`structure`** | **`spatial`** | **1.5 cr/pg** | Good balance: OCR + spatial elements, low cost |
+
+When the user's intent is unclear, **default to `structure` mode with `spatial` output**
+(1.5 extraction credits per page). Explain the cost/quality options and ask if a
+different mode is preferable before running on large documents.
+
+`mode='text'` is incompatible with `output_format='spatial'`; the client rejects the
+combination before the network call.
+
+### Invocation
+
+```bash
+# Default: structure mode, spatial output
+uv run scripts/parse.py --input doc.pdf --out out.json
+
+# Markdown for RAG (text mode — cheapest)
+uv run scripts/parse.py --input doc.pdf --out out.md --output-format markdown --mode text
+
+# Form extraction (understand mode)
+uv run scripts/parse.py --input doc.pdf --out out.json --mode understand
+
+# Agentic (VLM alt text on pictures)
+uv run scripts/parse.py --input doc.pdf --out out.json --mode agentic
+```
+
+The script prints extraction-credit usage after each run so you can verify the cost.
+
+### Downstream consumption
+
+After a single `/parse` call, slice the response for common needs:
+
+- **Reading-order plain text**: walk `output.elements` sorted by `(page.pageIndex, readingOrder)`, join `paragraph` and `handwriting` `text` fields
+- **Tables**: project `cells[]` on each `table` element into rows/columns using `cell.row` and `cell.column`
+- **Key-value pairs**: read `pairs[]` on each `keyValueRegion` element — each pair has `.key.value` and `.value.value`
+- **Formulas**: read `latex` on each `formula` element
+- **Pictures**: read `classification` and `altDescription` (populated by `agentic` mode) on each `picture` element
+- **Markdown output**: call with `--output-format markdown`; the script writes the Markdown string directly
+
+Full patterns with Python snippets and jq one-liners: `references/parse-output-filtering.md`
+
+### Input constraint
+
+`parse.py` only accepts **local file paths** — the underlying API endpoint is
+multipart-only. For remote inputs, download the file first.
+
+## Rules
+
+- One script per skill: `scripts/parse.py`. Do not add new committed scripts for /build
+  workflows here — those belong in the sibling `document-processor-api` skill.
+- Always preserve the printed credit-usage summary in script output so the operator can
+  observe per-call cost.
+- Do not add a URL-fetch shortcut; the endpoint is multipart-only.
diff --git a/plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md b/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
similarity index 100%
rename from plugins/nutrient-dws/skills/document-processor-api/references/parse-output-filtering.md
rename to plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py b/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
new file mode 100644
index 0000000..48d2ded
--- /dev/null
+++ b/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
@@ -0,0 +1,103 @@
+import json
+import os
+import re
+import sys
+from pathlib import Path
+from typing import Any, NoReturn
+
+_NEGATIVE_VALUE_RE = re.compile(r"^-\d")
+
+
+def create_client():
+    """Create and return a NutrientClient configured for DWS Extract.
+
+    DWS Extract is a separate product from DWS Processor and has its own
+    API key. Reads NUTRIENT_EXTRACT_API_KEY (required); falls back to
+    NUTRIENT_API_KEY if the former is unset, so a single global key works
+    once DWS rolls those out.
+    """
+    extract_api_key = os.environ.get("NUTRIENT_EXTRACT_API_KEY")
+    fallback_key = os.environ.get("NUTRIENT_API_KEY")
+    if not extract_api_key and not fallback_key:
+        raise RuntimeError(
+            "NUTRIENT_EXTRACT_API_KEY is not set. DWS Extract requires its own "
+            "API key (separate from the DWS Processor key). Export it before "
+            "running this skill's scripts."
+        )
+    try:
+        from nutrient_dws import NutrientClient
+    except ImportError as e:
+        raise RuntimeError(
+            "Unable to import nutrient_dws. Install with: uv add 'nutrient-dws>=3.1.0'\n"
+            f"Original error: {e}"
+        ) from e
+    # api_key is required by the constructor; parse() will swap to
+    # extract_api_key when set. Pass the Extract key through both so the
+    # client works whether or not it's also a global key.
+    primary = extract_api_key or fallback_key
+    return NutrientClient(api_key=primary, extract_api_key=extract_api_key)
+
+
+def write_json_output(result: dict, path: str) -> None:
+    """Write a JSON-serialisable result to disk."""
+    out = Path(path)
+    out.parent.mkdir(parents=True, exist_ok=True)
+    with open(out, "w", encoding="utf-8") as f:
+        json.dump(result, f, indent=2)
+    print(f"Wrote {path}")
+
+
+def parse_csv(value: str) -> list[str]:
+    """Split a comma-separated string into a list of trimmed, non-empty strings."""
+    if not value:
+        return []
+    return [item.strip() for item in str(value).split(",") if item.strip()]
+
+
+def assert_local_file(value: str, arg: str) -> str:
+    """Raise if value looks like a URL; otherwise return the path."""
+    v = str(value).strip()
+    if v.startswith("http://") or v.startswith("https://"):
+        raise ValueError(f"--{arg} must be a local file path for this operation.")
+    return v
+
+
+def read_json_file(path: str) -> Any:
+    """Read and parse a JSON file."""
+    try:
+        with open(path, encoding="utf-8") as f:
+            return json.load(f)
+    except json.JSONDecodeError as e:
+        raise ValueError(f"Invalid JSON in file ({path}): {e}") from e
+
+
+def fix_negative_args() -> list[str]:
+    """Return sys.argv[1:] with negative numeric values joined to their flag.
+
+    argparse treats values like ``-1`` or ``-1:3`` as unknown option flags when
+    passed as a separate token. This helper reattaches them using ``=`` so that
+    ``--pages -1`` becomes ``--pages=-1`` before argparse sees the arguments.
+    """
+    argv = sys.argv[1:]
+    result = []
+    i = 0
+    while i < len(argv):
+        arg = argv[i]
+        if (
+            arg.startswith("--")
+            and "=" not in arg
+            and i + 1 < len(argv)
+            and _NEGATIVE_VALUE_RE.match(argv[i + 1])
+        ):
+            result.append(f"{arg}={argv[i + 1]}")
+            i += 2
+        else:
+            result.append(arg)
+            i += 1
+    return result
+
+
+def handle_error(e: Exception) -> NoReturn:
+    """Print the error message and exit with code 1."""
+    print(str(e), file=sys.stderr)
+    sys.exit(1)
diff --git a/plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py b/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
similarity index 95%
rename from plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py
rename to plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
index 1c87e8d..64cb7c9 100644
--- a/plugins/nutrient-dws/skills/document-processor-api/scripts/parse.py
+++ b/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
@@ -9,6 +9,10 @@
 One call returns the full structural document model — typed elements with bounding boxes,
 confidence scores, and reading order — or a whole-document Markdown string.
 
+DWS Extract is a separate product from DWS Processor. It uses its own API key, supplied
+via the NUTRIENT_EXTRACT_API_KEY environment variable. Calls to /extraction/parse with a
+DWS Processor key return 403.
+
 Billing note: /extraction/parse is billed against **extraction credits**, which are a
 separate billing bucket from the processor API credits consumed by /build, /sign, OCR,
 and other Processor API endpoints.
@@ -127,7 +131,7 @@ async def main() -> None:
 
     # Print usage summary so callers can see credit cost without opening the output file
     usage = response.get("usage", {})
-    credits_info = usage.get("dataExtractionCredits", {})
+    credits_info = usage.get("data_extraction_credits", {})
     cost = credits_info.get("cost")
     remaining = credits_info.get("remainingCredits")
     metrics = response.get("metrics", {})
diff --git a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
index 66bfacc..d283235 100644
--- a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
@@ -1,13 +1,13 @@
 ---
 name: document-processor-api
 description: >-
-  Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs,
-  convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value
-  pairs, parse documents into a structural model or Markdown (for RAG indexing, form/invoice
-  extraction, or layout-aware understanding), redact PII, watermark, sign, fill forms, optimize
-  PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF, merge
-  these PDFs, OCR this scan, extract tables, parse this document, extract for RAG, redact PII,
-  sign this PDF, make this PDF/A, or linearize for web delivery.
+  Process documents with Nutrient DWS Processor. Use when the user wants to generate PDFs
+  from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans,
+  extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize
+  PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF,
+  merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this
+  PDF/A, or linearize for web delivery. For document understanding / RAG / Markdown / layout
+  via `/extraction/parse`, use the sibling `document-extraction-api` skill.
 license: MIT
 metadata:
   author: nutrient-sdk
@@ -38,85 +38,22 @@ Use Nutrient DWS for managed document workflows where fidelity, compliance, or m
 ## When to use
 - Generate PDFs from HTML templates, uploaded assets, or remote URLs.
 - Convert Office, HTML, image, and PDF files between supported formats.
-- OCR scans and extract text, tables, or key-value pairs.
-- Parse a document into its structural model or whole-document Markdown for RAG indexing, form/invoice extraction, or layout-aware understanding.
+- OCR scans and extract text, tables, or key-value pairs (via `/build` actions).
 - Redact PII, watermark, sign, fill forms, merge, split, rotate, flatten, or encrypt PDFs.
 - Produce delivery targets like PDF/A, PDF/UA, optimized PDFs, or linearized PDFs.
 - Check credits before large, batch, or AI-heavy runs.
 
+For document understanding via `/extraction/parse` (RAG, layout analysis, form/invoice
+extraction with bounding boxes, whole-document Markdown), use the sibling
+`document-extraction-api` skill instead — it is a separate DWS product with its own API key.
+
 ## Tool preference
 1. Prefer `scripts/*.py` for covered single-operation workflows.
 2. Use `assets/templates/custom-workflow-template.py` for multi-step jobs that should still run through the Python client.
 3. Use the modular `references/` docs and direct API payloads for capabilities that do not yet have a dedicated helper script, especially HTML/URL generation and compliance tuning.
 4. Use local PDF utilities only for lightweight inspection. Use Nutrient when output fidelity or compliance matters.
 
-## Data Extraction (`/extraction/parse`)
-
-Use `scripts/parse.py` for any task involving document understanding, content extraction,
-RAG indexing, form data extraction, or layout analysis.
-
-**`/extraction/parse` is a document-understanding primitive**: one call returns the full
-structural document model — typed elements with bounding boxes, confidence scores, and
-reading order — or a whole-document Markdown string. You always receive all element types
-in a single call.
-
-### Picking a mode
-
-Choose based on the user's intent and acceptable credit cost. All costs are
-**extraction credits per page** — a separate billing bucket from the processor API
-credits consumed by `/build`, `/sign`, OCR, and other Processor API endpoints.
-
-| User intent | Mode | Output format | Cost | Notes |
-|-------------|------|---------------|------|-------|
-| RAG / search indexing / content migration — born-digital PDF | `text` | `markdown` | 1 cr/pg | Cheapest path; no OCR or AI needed |
-| RAG / search indexing — scanned or image-based PDF | `structure` | `markdown` | 1.5 cr/pg | OCR required before Markdown assembly |
-| Form / invoice extraction | `understand` | `spatial` | 9 cr/pg | AI classification for reliable key-value and table detection |
-| Layout-aware document understanding | `understand` | `spatial` | 9 cr/pg | Semantic paragraph roles (Title, SectionHeader, etc.) |
-| Deep visual understanding (charts, diagrams, alt text) | `agentic` | `spatial` | 18 cr/pg | VLM adds alt descriptions on every picture element |
-| **Default / ambiguous intent** | **`structure`** | **`spatial`** | **1.5 cr/pg** | Good balance: OCR + spatial elements, low cost |
-
-When the user's intent is unclear, **default to `structure` mode with `spatial` output**
-(1.5 extraction credits per page). Explain the cost/quality options and ask if a
-different mode is preferable before running on large documents.
-
-### Invocation
-
-```bash
-# Default: structure mode, spatial output
-uv run scripts/parse.py --input doc.pdf --out out.json
-
-# Markdown for RAG (text mode — cheapest)
-uv run scripts/parse.py --input doc.pdf --out out.md --output-format markdown --mode text
-
-# Form extraction (understand mode)
-uv run scripts/parse.py --input doc.pdf --out out.json --mode understand
-
-# Agentic (VLM alt text on pictures)
-uv run scripts/parse.py --input doc.pdf --out out.json --mode agentic
-```
-
-The script prints extraction-credit usage after each run so you can verify the cost.
-
-### Downstream consumption
-
-After a single `/parse` call, slice the response for common needs:
-
-- **Reading-order plain text**: walk `output.elements` sorted by `(page.pageIndex, readingOrder)`, join `paragraph` and `handwriting` `text` fields
-- **Tables**: project `cells[]` on each `table` element into rows/columns using `cell.row` and `cell.column`
-- **Key-value pairs**: read `pairs[]` on each `keyValueRegion` element — each pair has `.key.value` and `.value.value`
-- **Formulas**: read `latex` on each `formula` element
-- **Pictures**: read `classification` and `altDescription` (populated by `agentic` mode) on each `picture` element
-- **Markdown output**: call with `--output-format markdown`; the script writes the Markdown string directly
-
-Full patterns with Python snippets and jq one-liners: `references/parse-output-filtering.md`
-
-### Input constraint
-
-`parse.py` only accepts **local file paths** — the underlying API endpoint is
-multipart-only. For remote inputs, download the file first.
-
 ## Single-operation scripts
-- `parse.py` -> document understanding via `/extraction/parse` (structural model or whole-document Markdown)
 - `convert.py` -> convert between `pdf`, `pdfa`, `pdfua`, `docx`, `xlsx`, `pptx`, `png`, `jpeg`, `webp`, `html`, and `markdown`
 - `merge.py` -> merge multiple files into one PDF
 - `split.py` -> split one PDF into multiple PDFs by page ranges
@@ -148,7 +85,6 @@ When the user asks for multiple operations in one run:
 - `split.py` requires a multi-page PDF and cannot extract ranges from a single-page document.
 - `delete-pages.py` must retain at least one page and cannot delete the entire document.
 - `sign.py` only accepts local file paths for the main PDF.
-- `parse.py` only accepts local file paths (the `/extraction/parse` endpoint is multipart-only).
 
 ## Decision rules
 - Prefer a helper script when one already covers the requested operation cleanly.
@@ -177,7 +113,6 @@ Read only what you need:
 - `references/generation-and-conversion.md` -> HTML/URL generation and format conversion
 - `references/pdf-manipulation.md` -> merge, split, page-range, rotate, and flatten workflows
 - `references/extraction-and-ocr.md` -> OCR, text extraction, tables, and key-value workflows
-- `references/parse-output-filtering.md` -> `/extraction/parse` downstream consumption patterns (reading-order text, tables, key-values, formulas, pictures)
 - `references/security-signing-and-forms.md` -> redaction, watermarking, signatures, forms, and passwords
 - `references/compliance-and-optimization.md` -> PDF/A, PDF/UA, optimization, and linearization
 - `references/workflow-recipes.md` -> end-to-end sequencing patterns for common business document workflows

From 7ed44700878fbe7c5f503e46b93b8f729aef90df Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 06:02:46 +1200
Subject: [PATCH 3/9] Revert SKILL.md changes in document-processor-api
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The split into document-extraction-api is purely additive — the processor
skill doesn't need cross-links or trimming. Leave it untouched.
---
 .../skills/document-processor-api/SKILL.md    | 19 +++++++------------
 1 file changed, 7 insertions(+), 12 deletions(-)

diff --git a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
index d283235..7df3bd0 100644
--- a/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-processor-api/SKILL.md
@@ -1,13 +1,11 @@
 ---
 name: document-processor-api
 description: >-
-  Process documents with Nutrient DWS Processor. Use when the user wants to generate PDFs
-  from HTML or URLs, convert Office/images/PDFs, assemble or split packets, OCR scans,
-  extract text/tables/key-value pairs, redact PII, watermark, sign, fill forms, optimize
-  PDFs, or produce compliance outputs like PDF/A or PDF/UA. Triggers include convert to PDF,
-  merge these PDFs, OCR this scan, extract tables, redact PII, sign this PDF, make this
-  PDF/A, or linearize for web delivery. For document understanding / RAG / Markdown / layout
-  via `/extraction/parse`, use the sibling `document-extraction-api` skill.
+  Process documents with Nutrient DWS. Use when the user wants to generate PDFs from HTML or URLs,
+  convert Office/images/PDFs, assemble or split packets, OCR scans, extract text/tables/key-value
+  pairs, redact PII, watermark, sign, fill forms, optimize PDFs, or produce compliance outputs like
+  PDF/A or PDF/UA. Triggers include convert to PDF, merge these PDFs, OCR this scan, extract tables,
+  redact PII, sign this PDF, make this PDF/A, or linearize for web delivery.
 license: MIT
 metadata:
   author: nutrient-sdk
@@ -38,15 +36,11 @@ Use Nutrient DWS for managed document workflows where fidelity, compliance, or m
 ## When to use
 - Generate PDFs from HTML templates, uploaded assets, or remote URLs.
 - Convert Office, HTML, image, and PDF files between supported formats.
-- OCR scans and extract text, tables, or key-value pairs (via `/build` actions).
+- OCR scans and extract text, tables, or key-value pairs.
 - Redact PII, watermark, sign, fill forms, merge, split, rotate, flatten, or encrypt PDFs.
 - Produce delivery targets like PDF/A, PDF/UA, optimized PDFs, or linearized PDFs.
 - Check credits before large, batch, or AI-heavy runs.
 
-For document understanding via `/extraction/parse` (RAG, layout analysis, form/invoice
-extraction with bounding boxes, whole-document Markdown), use the sibling
-`document-extraction-api` skill instead — it is a separate DWS product with its own API key.
-
 ## Tool preference
 1. Prefer `scripts/*.py` for covered single-operation workflows.
 2. Use `assets/templates/custom-workflow-template.py` for multi-step jobs that should still run through the Python client.
@@ -133,3 +127,4 @@ Read only what you need:
   - Use process env injection at runtime (shell/export, secrets manager, or host env).
 - Restrict file access with `SANDBOX_PATH` to the minimum required working directory.
 - Before enabling MCP mode in production, verify package provenance and lock version.
+

From eb1ae0e95e3ef345b973c985069b825c00768e4c Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 06:06:58 +1200
Subject: [PATCH 4/9] Tighten extraction-api skill from review feedback
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

- references/parse-output-filtering.md: snake_case `data_extraction_credits`
  to match the actual response shape (it was camelCase in three places —
  the schema diagram, the Python snippet, and the prose note). Anyone
  following the reference's Python snippet would silently get nothing
  back. Verified against the live API.
- scripts/lib/common.py: use `is None` instead of truthiness for the env
  var checks, so `export NUTRIENT_EXTRACT_API_KEY=` (explicit empty) is
  treated as a misconfiguration to surface, not as "fall back to the
  Processor key". Also drop helpers carried over from the sibling skill's
  common.py that this skill never uses (write_json_output, parse_csv,
  read_json_file, fix_negative_args).
- scripts/parse.py: call assert_local_file() on `--input` so URL inputs
  produce a clear error message instead of leaking through to a
  misleading FileNotFoundError.
---
 .../references/parse-output-filtering.md      |  6 +-
 .../scripts/lib/common.py                     | 69 ++-----------------
 .../document-extraction-api/scripts/parse.py  |  5 +-
 3 files changed, 13 insertions(+), 67 deletions(-)

diff --git a/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md b/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
index 9c7e525..ae6eda3 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
+++ b/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
@@ -20,7 +20,7 @@ ParseResponse
 │   ├── pagesProcessed
 │   └── processingTimeMs
 └── usage
-    └── dataExtractionCredits
+    └── data_extraction_credits
         ├── cost            — extraction credits used by this call
         └── remainingCredits
 ```
@@ -210,12 +210,12 @@ for pic in pictures:
 
 ```python
 usage = response.get("usage", {})
-credits = usage.get("dataExtractionCredits", {})
+credits = usage.get("data_extraction_credits", {})
 print(f"Cost: {credits.get('cost')} extraction credits")
 print(f"Remaining: {credits.get('remainingCredits')}")
 ```
 
-Note: `dataExtractionCredits` reflects charges from the **extraction credits** bucket,
+Note: `data_extraction_credits` reflects charges from the **extraction credits** bucket,
 which is separate from the **processor API credits** bucket used by `/build`, `/sign`,
 OCR, and other Processor API endpoints.
 
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py b/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
index 48d2ded..a7b6972 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
+++ b/plugins/nutrient-dws/skills/document-extraction-api/scripts/lib/common.py
@@ -1,11 +1,6 @@
-import json
 import os
-import re
 import sys
-from pathlib import Path
-from typing import Any, NoReturn
-
-_NEGATIVE_VALUE_RE = re.compile(r"^-\d")
+from typing import NoReturn
 
 
 def create_client():
@@ -15,10 +10,14 @@ def create_client():
     API key. Reads NUTRIENT_EXTRACT_API_KEY (required); falls back to
     NUTRIENT_API_KEY if the former is unset, so a single global key works
     once DWS rolls those out.
+
+    Uses `is None` rather than truthiness so an explicitly empty
+    NUTRIENT_EXTRACT_API_KEY (`export NUTRIENT_EXTRACT_API_KEY=`) is treated
+    as a misconfiguration to surface, not as "fall back to the other key".
     """
     extract_api_key = os.environ.get("NUTRIENT_EXTRACT_API_KEY")
     fallback_key = os.environ.get("NUTRIENT_API_KEY")
-    if not extract_api_key and not fallback_key:
+    if extract_api_key is None and fallback_key is None:
         raise RuntimeError(
             "NUTRIENT_EXTRACT_API_KEY is not set. DWS Extract requires its own "
             "API key (separate from the DWS Processor key). Export it before "
@@ -31,29 +30,10 @@ def create_client():
             "Unable to import nutrient_dws. Install with: uv add 'nutrient-dws>=3.1.0'\n"
             f"Original error: {e}"
         ) from e
-    # api_key is required by the constructor; parse() will swap to
-    # extract_api_key when set. Pass the Extract key through both so the
-    # client works whether or not it's also a global key.
-    primary = extract_api_key or fallback_key
+    primary = extract_api_key if extract_api_key is not None else fallback_key
     return NutrientClient(api_key=primary, extract_api_key=extract_api_key)
 
 
-def write_json_output(result: dict, path: str) -> None:
-    """Write a JSON-serialisable result to disk."""
-    out = Path(path)
-    out.parent.mkdir(parents=True, exist_ok=True)
-    with open(out, "w", encoding="utf-8") as f:
-        json.dump(result, f, indent=2)
-    print(f"Wrote {path}")
-
-
-def parse_csv(value: str) -> list[str]:
-    """Split a comma-separated string into a list of trimmed, non-empty strings."""
-    if not value:
-        return []
-    return [item.strip() for item in str(value).split(",") if item.strip()]
-
-
 def assert_local_file(value: str, arg: str) -> str:
     """Raise if value looks like a URL; otherwise return the path."""
     v = str(value).strip()
@@ -62,41 +42,6 @@ def assert_local_file(value: str, arg: str) -> str:
     return v
 
 
-def read_json_file(path: str) -> Any:
-    """Read and parse a JSON file."""
-    try:
-        with open(path, encoding="utf-8") as f:
-            return json.load(f)
-    except json.JSONDecodeError as e:
-        raise ValueError(f"Invalid JSON in file ({path}): {e}") from e
-
-
-def fix_negative_args() -> list[str]:
-    """Return sys.argv[1:] with negative numeric values joined to their flag.
-
-    argparse treats values like ``-1`` or ``-1:3`` as unknown option flags when
-    passed as a separate token. This helper reattaches them using ``=`` so that
-    ``--pages -1`` becomes ``--pages=-1`` before argparse sees the arguments.
-    """
-    argv = sys.argv[1:]
-    result = []
-    i = 0
-    while i < len(argv):
-        arg = argv[i]
-        if (
-            arg.startswith("--")
-            and "=" not in arg
-            and i + 1 < len(argv)
-            and _NEGATIVE_VALUE_RE.match(argv[i + 1])
-        ):
-            result.append(f"{arg}={argv[i + 1]}")
-            i += 2
-        else:
-            result.append(arg)
-            i += 1
-    return result
-
-
 def handle_error(e: Exception) -> NoReturn:
     """Print the error message and exit with code 1."""
     print(str(e), file=sys.stderr)
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py b/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
index 64cb7c9..6cfa824 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
+++ b/plugins/nutrient-dws/skills/document-extraction-api/scripts/parse.py
@@ -48,7 +48,7 @@
 from pathlib import Path
 
 sys.path.insert(0, str(Path(__file__).parent))
-from lib.common import create_client, handle_error
+from lib.common import assert_local_file, create_client, handle_error
 
 
 async def main() -> None:
@@ -104,7 +104,8 @@ async def main() -> None:
     )
     args = parser.parse_args()
 
-    # Validate input is a local file (the /extraction/parse endpoint is multipart-only)
+    # Validate input is a local file (the /extraction/parse endpoint is multipart-only).
+    assert_local_file(args.input, "input")
     input_path = Path(args.input)
     if not input_path.exists():
         print(f"Error: input file not found: {args.input}", file=sys.stderr)

From 5176c7bf8fdebf6e951bdbde617eead37d3c5624 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 08:42:48 +1200
Subject: [PATCH 5/9] chore: gitignore skill-creator eval workspace for the new
 skill

---
 .gitignore | 4 ++++
 1 file changed, 4 insertions(+)
 create mode 100644 .gitignore

diff --git a/.gitignore b/.gitignore
new file mode 100644
index 0000000..7cff90e
--- /dev/null
+++ b/.gitignore
@@ -0,0 +1,4 @@
+
+# Skill-creator eval workspaces (local-only)
+plugins/nutrient-dws/skills/document-extraction-api-workspace/
+plugins/nutrient-dws/skills/document-extraction-api/evals/

From 4f77d3a59afd860e860d5c9269a498052bc45391 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 10:09:33 +1200
Subject: [PATCH 6/9] Clarify mode selection in extraction-api skill
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Add an inference principle that walks the request, filename, and intent
to pick the cheapest mode that satisfies every floor — explicitly no
clarifying questions to the user. Replace the vague "ask before large
documents" prose with a concrete 200-credit confirmation threshold.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .../skills/document-extraction-api/SKILL.md   | 31 +++++++++++++++++--
 1 file changed, 28 insertions(+), 3 deletions(-)

diff --git a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
index 817d6b1..0e6eab3 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
@@ -67,6 +67,30 @@ Choose based on the user's intent and acceptable credit cost. All costs are
 **extraction credits per page** — a separate billing bucket from the processor API
 credits consumed by `/build`, `/sign`, OCR, and other DWS Processor endpoints.
 
+**Principle — decide from the request alone; do not ask the user clarifying questions.**
+Walk the checks below in order. Each rule that fires sets a minimum mode — the final
+pick is the highest minimum across all rules that fired. If none fired, use the default
+(rule 5).
+
+1. **Explicit features named in the request** are non-negotiable.
+   - Key-value pairs, form fields, semantic role classification (Title / SectionHeader /
+     etc.), formulas, or handwriting → at minimum `understand` (9 cr/pg).
+   - Alt text on pictures, charts, or diagrams → `agentic` (18 cr/pg).
+2. **Document type implied by the request or filename.**
+   - `form`, `invoice`, `receipt`, `application`, `claim` → likely contains key-value
+     pairs → `understand`.
+   - `chart`, `infographic`, or diagram-heavy doc + the user wants descriptions →
+     `agentic`.
+3. **OCR signal from filename or request** (`scanned`, `image-based`, `photographed`,
+   `handwritten`, `screenshot`) → `structure` minimum; `text` mode silently fails on
+   image-only input.
+4. **Output format from intent.** RAG, search indexing, embeddings, or content migration
+   → `markdown`. Layout overlay, per-element processing, or bounded extraction →
+   `spatial`.
+5. **No cues match anything above** → documented default `structure` + `spatial`
+   (1.5 cr/pg). Handles both born-digital and scanned, gives bounded typed elements
+   with table cells, never silently drops content.
+
 | User intent | Mode | Output format | Cost | Notes |
 |-------------|------|---------------|------|-------|
 | RAG / search indexing / content migration — born-digital PDF | `text` | `markdown` | 1 cr/pg | Cheapest path; no OCR or AI needed |
@@ -76,9 +100,10 @@ credits consumed by `/build`, `/sign`, OCR, and other DWS Processor endpoints.
 | Deep visual understanding (charts, diagrams, alt text) | `agentic` | `spatial` | 18 cr/pg | VLM adds alt descriptions on every picture element |
 | **Default / ambiguous intent** | **`structure`** | **`spatial`** | **1.5 cr/pg** | Good balance: OCR + spatial elements, low cost |
 
-When the user's intent is unclear, **default to `structure` mode with `spatial` output**
-(1.5 extraction credits per page). Explain the cost/quality options and ask if a
-different mode is preferable before running on large documents.
+**Confirm before running when the estimated cost exceeds 200 extraction credits** —
+roughly 11 pages of `agentic`, 22 of `understand`, 133 of `structure`, or 200 of `text`.
+Surface the estimate (`pages × cost_per_page`) and ask the operator to confirm before
+invoking. Under that threshold, just run.
 
 `mode='text'` is incompatible with `output_format='spatial'`; the client rejects the
 combination before the network call.

From ff2ec128402e0fae84c4f08768f9ff92f7dd4a23 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 11:50:05 +1200
Subject: [PATCH 7/9] Remove top-level gitignore

The eval workspace it covered is local-only and doesn't need to be
masked from a repo-level rule. Anything regenerated by future
skill-creator runs lands untracked, the same as any other ephemeral
local artefact.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .gitignore | 4 ----
 1 file changed, 4 deletions(-)
 delete mode 100644 .gitignore

diff --git a/.gitignore b/.gitignore
deleted file mode 100644
index 7cff90e..0000000
--- a/.gitignore
+++ /dev/null
@@ -1,4 +0,0 @@
-
-# Skill-creator eval workspaces (local-only)
-plugins/nutrient-dws/skills/document-extraction-api-workspace/
-plugins/nutrient-dws/skills/document-extraction-api/evals/

From 249a9317f90728b289ce36ff849539ca2fc1b59d Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 12:07:14 +1200
Subject: [PATCH 8/9] Reference: point to official docs, suggest filtering
 tools
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Drop the duplicated schema walkthrough from the references doc and link
to the canonical pages on nutrient.io instead. The reference now lists
which tools we suggest for reshaping a `/parse` response (jq, json,
pandas, a LaTeX renderer, a markdown parser) — rather than re-stating
field shapes that are already authoritative upstream.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .../skills/document-extraction-api/SKILL.md   |   2 +-
 .../references/parse-output-filtering.md      | 264 ++----------------
 2 files changed, 31 insertions(+), 235 deletions(-)

diff --git a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
index 0e6eab3..f629e23 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
@@ -137,7 +137,7 @@ After a single `/parse` call, slice the response for common needs:
 - **Pictures**: read `classification` and `altDescription` (populated by `agentic` mode) on each `picture` element
 - **Markdown output**: call with `--output-format markdown`; the script writes the Markdown string directly
 
-Full patterns with Python snippets and jq one-liners: `references/parse-output-filtering.md`
+For the canonical response schema and per-mode field availability, see the official docs linked from `references/parse-output-filtering.md`; that file also lists the tools we suggest for filtering and reshaping the response.
 
 ### Input constraint
 
diff --git a/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md b/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
index ae6eda3..4c306ee 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
+++ b/plugins/nutrient-dws/skills/document-extraction-api/references/parse-output-filtering.md
@@ -1,236 +1,32 @@
 # Parse Output — Filtering and Downstream Patterns
 
-`/extraction/parse` returns a single document model in one call. You always receive all
-element types at once — there is no per-type call. This document shows how to slice the
-response into the shapes that downstream pipelines commonly need.
-
-All examples below assume you have already run `parse.py` with `--output-format spatial`
-and saved the response to `out.json`.
-
----
-
-## Response structure
-
-```
-ParseResponse
-├── output
-│   ├── elements[]          (spatial mode)  — typed element list
-│   └── markdown            (markdown mode) — whole-document Markdown string
-├── metrics
-│   ├── pagesProcessed
-│   └── processingTimeMs
-└── usage
-    └── data_extraction_credits
-        ├── cost            — extraction credits used by this call
-        └── remainingCredits
-```
-
-### Element types (discriminated on `type`)
-
-| type             | Key fields                                                      | Modes that produce it         |
-|------------------|-----------------------------------------------------------------|-------------------------------|
-| `paragraph`      | `text`, `role`, `words[]`, `bounds`, `readingOrder`             | all                           |
-| `table`          | `rowCount`, `columnCount`, `cells[]`, `bounds`, `readingOrder`  | structure / understand / agentic |
-| `formula`        | `latex`, `bounds`                                               | understand / agentic          |
-| `picture`        | `classification`, `altDescription`, `bounds`                    | all (agentic adds VLM alt text) |
-| `keyValueRegion` | `pairs[]` (each with `key`/`value` entities + bounds)           | understand / agentic          |
-| `handwriting`    | `text`, `words[]`, `bounds`                                     | understand / agentic          |
-
----
-
-## Reading-order plain text
-
-Walk elements in `(page.pageIndex, readingOrder)` order, collect `text` from
-`paragraph` and `handwriting` elements, join with newlines.
-
-```python
-import json
-
-with open("out.json") as f:
-    response = json.load(f)
-
-elements = response["output"]["elements"]
-
-text_elements = [
-    e for e in elements
-    if e.get("type") in ("paragraph", "handwriting") and e.get("text")
-]
-
-text_elements.sort(
-    key=lambda e: (e.get("page", {}).get("pageIndex", 0), e.get("readingOrder", 0))
-)
-
-plain_text = "\n\n".join(e["text"] for e in text_elements)
-print(plain_text)
-```
-
-### jq equivalent
-
-```bash
-jq -r '
-  [.output.elements[]
-   | select(.type == "paragraph" or .type == "handwriting")
-   | select(.text != null)
-  ]
-  | sort_by([.page.pageIndex // 0, .readingOrder // 0])
-  | .[].text
-' out.json | paste -sd '\n\n' /dev/stdin
-```
-
----
-
-## Tables — rows and columns dict
-
-Each `TableElement` carries a flat `cells[]` list. Reconstruct rows/columns by grouping
-on `row` and `column` (both 0-indexed). Multi-span cells span `rowSpan` rows and
-`colSpan` columns.
-
-```python
-def table_to_grid(table: dict) -> list[list[str]]:
-    """Return a list-of-rows, each row a list of cell text values."""
-    rows = table.get("rowCount", 0)
-    cols = table.get("columnCount", 0)
-    grid = [[""] * cols for _ in range(rows)]
-    for cell in table.get("cells") or []:
-        r, c = cell.get("row", 0), cell.get("column", 0)
-        if r < rows and c < cols:
-            grid[r][c] = cell.get("text", "")
-    return grid
-
-
-tables = [e for e in elements if e.get("type") == "table"]
-for i, table in enumerate(tables):
-    print(f"Table {i} (page {table.get('page', {}).get('pageIndex', 0)}):")
-    for row in table_to_grid(table):
-        print(" | ".join(row))
-```
-
-### jq — extract all table cells as JSON
-
-```bash
-jq '[
-  .output.elements[]
-  | select(.type == "table")
-  | {
-      page: .page.pageIndex,
-      readingOrder,
-      rowCount,
-      columnCount,
-      rows: (
-        [ .cells[]? | {row, col: .column, text} ]
-        | group_by(.row)
-        | map(sort_by(.col) | map(.text))
-      )
-    }
-]' out.json
-```
-
----
-
-## Key-value regions — key/value dict
-
-`keyValueRegion` elements carry a `pairs[]` list. Each pair has a `key` entity and a
-`value` entity, both with a `value` string field.
-
-```python
-kv_regions = [e for e in elements if e.get("type") == "keyValueRegion"]
-for region in kv_regions:
-    for pair in region.get("pairs") or []:
-        key_text = pair.get("key", {}).get("value", "")
-        val_text = pair.get("value", {}).get("value", "")
-        confidence = pair.get("relationshipConfidence")
-        print(f"{key_text!r}: {val_text!r}  (confidence={confidence})")
-```
-
-### jq equivalent
-
-```bash
-jq '[
-  .output.elements[]
-  | select(.type == "keyValueRegion")
-  | .pairs[]?
-  | { key: .key.value, value: .value.value, confidence: .relationshipConfidence }
-]' out.json
-```
-
----
-
-## Filtering by element type
-
-```python
-from typing import Literal
-
-def filter_elements(elements: list[dict], type_: str) -> list[dict]:
-    return [e for e in elements if e.get("type") == type_]
-
-paragraphs   = filter_elements(elements, "paragraph")
-tables       = filter_elements(elements, "table")
-formulas     = filter_elements(elements, "formula")
-pictures     = filter_elements(elements, "picture")
-kv_regions   = filter_elements(elements, "keyValueRegion")
-handwriting  = filter_elements(elements, "handwriting")
-```
-
-### jq
-
-```bash
-# Count by type
-jq '.output.elements | group_by(.type) | map({(.[0].type): length}) | add' out.json
-
-# All tables on page 0
-jq '[.output.elements[] | select(.type == "table" and .page.pageIndex == 0)]' out.json
-```
-
----
-
-## Formulas (LaTeX)
-
-```python
-formulas = [e for e in elements if e.get("type") == "formula" and e.get("latex")]
-for f in formulas:
-    print(f["latex"])
-```
-
----
-
-## Pictures with alt descriptions (agentic mode)
-
-`agentic` mode uses a vision language model to generate `altDescription` on every
-`picture` element. Other modes leave `altDescription` absent or empty.
-
-```python
-pictures = [e for e in elements if e.get("type") == "picture"]
-for pic in pictures:
-    print(f"[{pic.get('classification', 'unknown')}] {pic.get('altDescription', '')}")
-```
-
----
-
-## Checking extraction-credit cost
-
-```python
-usage = response.get("usage", {})
-credits = usage.get("data_extraction_credits", {})
-print(f"Cost: {credits.get('cost')} extraction credits")
-print(f"Remaining: {credits.get('remainingCredits')}")
-```
-
-Note: `data_extraction_credits` reflects charges from the **extraction credits** bucket,
-which is separate from the **processor API credits** bucket used by `/build`, `/sign`,
-OCR, and other Processor API endpoints.
-
----
-
-## Mode selection guide
-
-| Intent | Recommended mode | Cost | Why |
-|--------|-----------------|------|-----|
-| RAG / search indexing / content migration — born-digital PDF | `text` + `markdown` output | 1 cr/pg | No OCR needed; fastest path to a Markdown string |
-| RAG / search indexing — scanned or image PDF | `structure` + `markdown` output | 1.5 cr/pg | OCR required before Markdown assembly |
-| Form / invoice extraction | `understand` + `spatial` output | 9 cr/pg | AI classification needed for reliable key-value and table detection |
-| Layout-aware document understanding | `understand` + `spatial` output | 9 cr/pg | Semantic classification of paragraphs (Title, SectionHeader, etc.) |
-| Deep visual understanding (charts, diagrams) | `agentic` + `spatial` output | 18 cr/pg | VLM generates alt descriptions on every picture element |
-| Default / unknown intent | `structure` + `spatial` output | 1.5 cr/pg | Good balance: spatial elements with OCR, low cost |
-
-All costs are **extraction credits per page** — a separate billing bucket from
-processor API credits.
+The response shape of `/extraction/parse` — element types, field-by-field
+schemas, coordinate spaces, per-mode field availability — is documented
+upstream. Use those pages as the source of truth; this reference only
+suggests which tools to reach for when slicing and reshaping the response.
+
+## Official documentation
+
+- [Document element extraction (spatial output)](https://www.nutrient.io/guides/dws-data-extraction/parsing/extract-document-elements/) —
+  schema for `output.elements`, element types, bounding-box conventions.
+- [Markdown extraction](https://www.nutrient.io/guides/dws-data-extraction/parsing/extract-markdown/) —
+  shape of `output.markdown`.
+- [Processing modes](https://www.nutrient.io/guides/dws-data-extraction/parsing/processing-modes/) —
+  which fields each mode populates (e.g. `altDescription` only with
+  `agentic`; `keyValueRegion` and `formula` only with `understand` or
+  higher).
+- [Coordinate spaces](https://www.nutrient.io/guides/dws-data-extraction/parsing/coordinate-spaces/) —
+  how `bounds` relate to `page.width` / `page.height`.
+
+## Suggested tools
+
+| Task | Tool | Why |
+|---|---|---|
+| Filter or project the spatial JSON response | `jq` | Discriminate on `type` (`paragraph`, `table`, `picture`, …), select by `page.pageIndex` / `readingOrder`, or pull nested fields without writing code. |
+| Walk the response programmatically | the standard `json` module | The response is plain JSON; a recursive walk over `output.elements` is enough for type filtering, reading-order sort, and bounds extraction. |
+| Project tables into rows / columns | `pandas` | Tables come as a flat `cells[]` list with `row` / `column` indices; `pd.DataFrame` reshapes them cleanly. |
+| Render formulas | any LaTeX renderer (MathJax, KaTeX, matplotlib) | `formula` elements carry `latex` strings ready to feed a renderer. |
+| Post-process markdown output (chunk on headings, strip tables, etc.) | `markdown-it-py`, `mistune`, or a regex on `#` lines | `output.markdown` uses standard heading hierarchy. |
+
+For the per-call extraction-credit cost, read `usage.data_extraction_credits.cost`
+directly from the response — no tool needed.

From 9ac402c314f2a87fae413f2060c3cf06386c93e7 Mon Sep 17 00:00:00 2001
From: nickwinder <nfxdevelopment@gmail.com>
Date: Thu, 28 May 2026 12:12:26 +1200
Subject: [PATCH 9/9] Drop one-script-per-skill rule from extraction-api
 SKILL.md
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The rule was overly prescriptive — there's no architectural reason the
skill must stay single-script forever, and the sibling-skill boundary
between data extraction and /build workflows is already implicit in the
skill's purpose. Other rules in the section remain.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 plugins/nutrient-dws/skills/document-extraction-api/SKILL.md | 2 --
 1 file changed, 2 deletions(-)

diff --git a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
index f629e23..2fc4f57 100644
--- a/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
+++ b/plugins/nutrient-dws/skills/document-extraction-api/SKILL.md
@@ -146,8 +146,6 @@ multipart-only. For remote inputs, download the file first.
 
 ## Rules
 
-- One script per skill: `scripts/parse.py`. Do not add new committed scripts for /build
-  workflows here — those belong in the sibling `document-processor-api` skill.
 - Always preserve the printed credit-usage summary in script output so the operator can
   observe per-call cost.
 - Do not add a URL-fetch shortcut; the endpoint is multipart-only.