feat: add mm-skill for multi-modal directory indexing and exploration by spillai · Pull Request #10 · vlm-run/skills

spillai · 2026-04-10T17:29:59Z

Summary

Adds a new mm-skill to the skills repository. This skill provides agent instructions for the mm CLI — a high-performance multi-modal context management tool for indexing, exploring, querying, and extracting content from directories containing images, videos, PDFs, code, and other files.

Changes:

New skills/mm-skill/SKILL.md with full command reference (find, cat, grep, sql, wc, bench, config, profile)
Registered the skill in .claude-plugin/marketplace.json
Added the skill to the available skills table in README.md

Review & Testing Checklist for Human

Review skills/mm-skill/SKILL.md content to confirm it is appropriate for this public repository and does not contain any sensitive internal details
Verify the install URL (https://vlm-run.github.io/mm/install/install.sh) is live and correct
Confirm marketplace.json parses as valid JSON and the new plugin entry looks correct

Notes

The SKILL.md was copied verbatim from an internal source; no modifications were made to the content.

Link to Devin session: https://app.devin.ai/sessions/b476c7817877422383b2f73aa1381bd6
Requested by: @spillai

Co-Authored-By: Sudeep Pillai <sudeep.pillai@gmail.com>

devin-ai-integration · 2026-04-10T17:30:02Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 1 additional finding.

gemini-code-assist

Code Review

This pull request introduces the mm-skill, a multi-modal context management CLI, by adding it to the marketplace configuration and providing a detailed SKILL.md guide. The documentation covers various commands for indexing, searching, and extracting content from diverse file types. Feedback focuses on correcting a contradiction regarding document extraction processing levels and consolidating several redundant entries in the tips and modes sections of the documentation.

gemini-code-assist · 2026-04-10T17:31:41Z

+Level 1 behavior by file type (<100ms target):
+
+- **PDF** (.pdf): text extraction via pypdfium2. Scanned/image-only PDFs return empty.
+- **Document** (.docx, .pptx): markdown extraction via docling at L2.


This item is listed under Level 1 behavior but explicitly mentions at L2. Additionally, docling extraction typically takes longer than the 100ms target mentioned in the section header. Please clarify the level and performance expectations for this file type.

gemini-code-assist · 2026-04-10T17:31:41Z

+- Start with `find --tree --depth 1` then `wc --by-kind` for the fastest directory overview.
+- Use `mm find . --tree` to explore project structure.
+- Use `--format json` when you need to parse output programmatically.
+- `find` returns paths only when piped, else it returns full metadata rows.
+- For exhaustive path lists, prefer `mm find <dir> --columns path | tail -n +2` and pipe/redirect as needed.
+- Start with `mm find <dir> --tree --depth 1` then `mm wc <dir> --by-kind` for the fastest directory overview.
+- Use `--format json` when you need to parse output programmatically.
+- `find` returns paths only when piped, full metadata rows in TTY.


This section contains several redundant tips. Lines 288-290 repeat information already provided in lines 283-286. Consolidating them improves the clarity of the instructions for the agent.

Suggested change

- Start with `find --tree --depth 1` then `wc --by-kind` for the fastest directory overview.

- Use `mm find . --tree` to explore project structure.

- Use `--format json` when you need to parse output programmatically.

- `find` returns paths only when piped, else it returns full metadata rows.

- For exhaustive path lists, prefer `mm find <dir> --columns path | tail -n +2` and pipe/redirect as needed.

- Start with `mm find <dir> --tree --depth 1` then `mm wc <dir> --by-kind` for the fastest directory overview.

- Use `--format json` when you need to parse output programmatically.

- `find` returns paths only when piped, full metadata rows in TTY.

- Start with mm find <dir> --tree --depth 1 then mm wc <dir> --by-kind for the fastest directory overview.

- Use mm find . --tree to explore project structure.

- Use --format json when you need to parse output programmatically.

- find returns paths only when piped, full metadata rows in TTY.

- For exhaustive path lists, prefer mm find <dir> --columns path | tail -n +2 and pipe/redirect as needed.

gemini-code-assist · 2026-04-10T17:31:41Z

+- L2 modes: (default, simple LLM captions), `--mode fast` (10 words + tags), `--mode accurate` (200 words + tags).
+- Use `--mode fast` for quick L2 summaries (10 words), `--mode accurate` for detailed ones (200 words).


These two tips provide redundant information about the L2 modes. They should be merged into a single concise instruction.

Suggested change

- L2 modes: (default, simple LLM captions), `--mode fast` (10 words + tags), `--mode accurate` (200 words + tags).

- Use `--mode fast` for quick L2 summaries (10 words), `--mode accurate` for detailed ones (200 words).

- L2 modes: (default, simple LLM captions), --mode fast (10 words + tags), --mode accurate (200 words + tags).

feat: add mm-skill for multi-modal directory indexing and exploration

bb5f0ba

Co-Authored-By: Sudeep Pillai <sudeep.pillai@gmail.com>

devin-ai-integration Bot assigned spillai Apr 10, 2026

devin-ai-integration Bot reviewed Apr 10, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add mm-skill for multi-modal directory indexing and exploration#10

feat: add mm-skill for multi-modal directory indexing and exploration#10
spillai wants to merge 1 commit intomainfrom
devin/1775842106-add-mm-skill

spillai commented Apr 10, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

devin-ai-integration Bot commented Apr 10, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		- L2 modes: (default, simple LLM captions), `--mode fast` (10 words + tags), `--mode accurate` (200 words + tags).
		- Use `--mode fast` for quick L2 summaries (10 words), `--mode accurate` for detailed ones (200 words).

Conversation

spillai commented Apr 10, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration Bot commented Apr 10, 2026

🤖 Devin AI Engineer

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

spillai commented Apr 10, 2026 •

edited by devin-ai-integration Bot

Loading