[routing] Add LLM-powered ClusteringEngine implementation for semantic tool grouping

## Context

Issue #47 defines the `ClusteringEngine` protocol for partitioning tools into groups. The default implementation will be `JaccardClusterer`, wrapping the existing tag/Jaccard-based logic from `TreeBuilder`. However, no LLM-powered implementation is planned — and this is where the core Pillar 3 value lies: "use an LM to better understand the relationship between tools."

### Current state
- `TreeBuilder` clusters tools by tag overlap using Jaccard similarity — purely lexical.
- Tools with no tag overlap end up in catch-all groups, even when they are semantically related (e.g., "search_emails" and "find_messages" share zero tags but are clearly related).
- The `ClusteringEngine` protocol (#47) will create the extension point. This issue creates the intelligence that plugs into it.

## Why it matters

- **Vision centerpiece** — "Use an LM to understand the relationship between tools" requires semantic understanding of tool descriptions, not just tag matching.
- **Better ChoiceGraph quality** — LLM-grouped trees present more intuitive navigation to the agent: "Communication tools" vs "Data tools" vs "Admin tools" instead of arbitrary tag-based splits.
- **Scale** — At 500+ tools, manual tagging breaks down. Semantic clustering scales without human curation.

## Acceptance Criteria

- [ ] `LLMClusteringEngine` class implementing `ClusteringEngine` protocol (from #47)
- [ ] Accepts an `llm_fn: Callable[[str], str]` parameter — no dependency on any LLM provider
- [ ] Clusters tools by semantic similarity of descriptions using the LLM
- [ ] Prompt template: presents all tool names + descriptions, asks the LLM to propose ~k groups with names and membership
- [ ] Parses structured LLM output (JSON/YAML) into `dict[str, list[SelectableItem]]`
- [ ] Graceful fallback: if LLM output is unparseable, falls back to `JaccardClusterer`
- [ ] Deterministic for same inputs + same LLM response (no randomness in clustering logic itself)
- [ ] Registers in `EngineRegistry` as `"llm"` clustering engine
- [ ] Unit tests with mock `llm_fn` (valid response, invalid response, empty tools, single group)
- [ ] `pyproject.toml`: no new runtime dependencies (LLM is user-provided via callable)

## Implementation Notes

```python
class LLMClusteringEngine:
    def __init__(self, llm_fn: Callable[[str], str], *, fallback: ClusteringEngine | None = None) -> None: ...
    
    def cluster(self, items: list[SelectableItem], k: int) -> dict[str, list[SelectableItem]]:
        prompt = self._build_prompt(items, k)
        response = self.llm_fn(prompt)
        try:
            return self._parse_response(response, items)
        except ParseError:
            if self.fallback:
                return self.fallback.cluster(items, k)
            raise
```

**Files likely touched:**
- `src/contextweaver/engines.py` (or new `src/contextweaver/extras/clustering_llm.py`)
- `tests/test_engines.py`

## Dependencies

- **Requires #47** — `ClusteringEngine` protocol and `EngineRegistry`
- Related: #8 (embedding retrieval — could provide embeddings for clustering instead of full LLM calls)
- Enables: auto-generated ChoiceGraph (using LLM groups as graph structure)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[routing] Add LLM-powered ClusteringEngine implementation for semantic tool grouping #153

Context

Current state

Why it matters

Acceptance Criteria

Implementation Notes

Dependencies

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[routing] Add LLM-powered ClusteringEngine implementation for semantic tool grouping #153

Description

Context

Current state

Why it matters

Acceptance Criteria

Implementation Notes

Dependencies

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions