Skip to content

feat: add mm-skill for multi-modal directory indexing and exploration#10

Open
spillai wants to merge 1 commit intomainfrom
devin/1775842106-add-mm-skill
Open

feat: add mm-skill for multi-modal directory indexing and exploration#10
spillai wants to merge 1 commit intomainfrom
devin/1775842106-add-mm-skill

Conversation

@spillai
Copy link
Copy Markdown
Contributor

@spillai spillai commented Apr 10, 2026

Summary

Adds a new mm-skill to the skills repository. This skill provides agent instructions for the mm CLI — a high-performance multi-modal context management tool for indexing, exploring, querying, and extracting content from directories containing images, videos, PDFs, code, and other files.

Changes:

  • New skills/mm-skill/SKILL.md with full command reference (find, cat, grep, sql, wc, bench, config, profile)
  • Registered the skill in .claude-plugin/marketplace.json
  • Added the skill to the available skills table in README.md

Review & Testing Checklist for Human

  • Review skills/mm-skill/SKILL.md content to confirm it is appropriate for this public repository and does not contain any sensitive internal details
  • Verify the install URL (https://vlm-run.github.io/mm/install/install.sh) is live and correct
  • Confirm marketplace.json parses as valid JSON and the new plugin entry looks correct

Notes

  • The SKILL.md was copied verbatim from an internal source; no modifications were made to the content.

Link to Devin session: https://app.devin.ai/sessions/b476c7817877422383b2f73aa1381bd6
Requested by: @spillai


Open with Devin

Co-Authored-By: Sudeep Pillai <sudeep.pillai@gmail.com>
@devin-ai-integration
Copy link
Copy Markdown
Contributor

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

  • Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
  • Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

  • Disable automatic comment and CI monitoring

Copy link
Copy Markdown
Contributor

@devin-ai-integration devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 1 additional finding.

Open in Devin Review

Copy link
Copy Markdown

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces the mm-skill, a multi-modal context management CLI, by adding it to the marketplace configuration and providing a detailed SKILL.md guide. The documentation covers various commands for indexing, searching, and extracting content from diverse file types. Feedback focuses on correcting a contradiction regarding document extraction processing levels and consolidating several redundant entries in the tips and modes sections of the documentation.

Comment thread skills/mm-skill/SKILL.md
Level 1 behavior by file type (<100ms target):

- **PDF** (.pdf): text extraction via pypdfium2. Scanned/image-only PDFs return empty.
- **Document** (.docx, .pptx): markdown extraction via docling at L2.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

This item is listed under Level 1 behavior but explicitly mentions at L2. Additionally, docling extraction typically takes longer than the 100ms target mentioned in the section header. Please clarify the level and performance expectations for this file type.

Comment thread skills/mm-skill/SKILL.md
Comment on lines +283 to +290
- Start with `find --tree --depth 1` then `wc --by-kind` for the fastest directory overview.
- Use `mm find . --tree` to explore project structure.
- Use `--format json` when you need to parse output programmatically.
- `find` returns paths only when piped, else it returns full metadata rows.
- For exhaustive path lists, prefer `mm find <dir> --columns path | tail -n +2` and pipe/redirect as needed.
- Start with `mm find <dir> --tree --depth 1` then `mm wc <dir> --by-kind` for the fastest directory overview.
- Use `--format json` when you need to parse output programmatically.
- `find` returns paths only when piped, full metadata rows in TTY.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

This section contains several redundant tips. Lines 288-290 repeat information already provided in lines 283-286. Consolidating them improves the clarity of the instructions for the agent.

Suggested change
- Start with `find --tree --depth 1` then `wc --by-kind` for the fastest directory overview.
- Use `mm find . --tree` to explore project structure.
- Use `--format json` when you need to parse output programmatically.
- `find` returns paths only when piped, else it returns full metadata rows.
- For exhaustive path lists, prefer `mm find <dir> --columns path | tail -n +2` and pipe/redirect as needed.
- Start with `mm find <dir> --tree --depth 1` then `mm wc <dir> --by-kind` for the fastest directory overview.
- Use `--format json` when you need to parse output programmatically.
- `find` returns paths only when piped, full metadata rows in TTY.
- Start with mm find <dir> --tree --depth 1 then mm wc <dir> --by-kind for the fastest directory overview.
- Use mm find . --tree to explore project structure.
- Use --format json when you need to parse output programmatically.
- find returns paths only when piped, full metadata rows in TTY.
- For exhaustive path lists, prefer mm find <dir> --columns path | tail -n +2 and pipe/redirect as needed.

Comment thread skills/mm-skill/SKILL.md
Comment on lines +297 to +298
- L2 modes: (default, simple LLM captions), `--mode fast` (10 words + tags), `--mode accurate` (200 words + tags).
- Use `--mode fast` for quick L2 summaries (10 words), `--mode accurate` for detailed ones (200 words).
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

These two tips provide redundant information about the L2 modes. They should be merged into a single concise instruction.

Suggested change
- L2 modes: (default, simple LLM captions), `--mode fast` (10 words + tags), `--mode accurate` (200 words + tags).
- Use `--mode fast` for quick L2 summaries (10 words), `--mode accurate` for detailed ones (200 words).
- L2 modes: (default, simple LLM captions), --mode fast (10 words + tags), --mode accurate (200 words + tags).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant