Problem Statement
Skillforge's live mode testing does not support SKILLs that use MCP (Model Context Protocol) tools. When running tests in --mode live:
- The AI API is called correctly
- But MCP tools (e.g.,
search, getIssue, updateIssue) are not passed to the AI model
- SKILLs that depend on MCP tools cannot be tested - the AI cannot execute required tool calls
- The AI either returns error or incomplete output
This limits testing to only SKILLs with pure logic/LLM generation, excluding integration-based SKILLs from live testing.
Proposed Solution
The simple solution of adding tools parameter to LLM API calls is insufficient because skillforge does not handle tool calling workflows:
- Tools parameter alone - LLM may call tools, but skillforge cannot process tool calls
- Multi-turn required - Tool calling requires multiple requests (tool call → execute → result → final answer)
- Result extraction - Skillforge only extracts
content, not tool_calls
What's needed (not implemented):
Changes required:
- Pass
tools parameter to LLM API calls
- Implement tool calling response processing
- Handle multi-turn conversations for tool execution
- Add tool execution endpoint for MCP tools
Alternatives Considered
- Mock mode only - Works for basic output validation, but cannot test real integration scenarios
- Separate integration tests - Requires maintaining separate test infrastructure outside skillforge
- Manual testing - Prone to human error, not suitable for CI/CD
Feature Category
Testing
Impact
Nice to have
Additional Context
SKILL examples requiring MCP tools:
redmine-duplicate-checker - search, getIssue, updateIssue (Redmine)
github-checker - searchIssues, getIssue, createComment (GitHub)
Current state comparison:
| Mode |
Tools Available |
Real Integration |
CI/CD Ready |
| Mock |
❌ No |
❌ No |
✅ Yes |
| Live (current) |
❌ No |
❌ No |
✅ Yes |
| Live (proposed) |
✅ Yes |
✅ Yes |
✅ Yes |
Impact:
- Without this: Teams cannot CI/CD test SKILLs with MCP tools, integration bugs only caught in production
- With this: Complete CI/CD for all SKILL types, early bug detection, single test framework
Problem Statement
Skillforge's live mode testing does not support SKILLs that use MCP (Model Context Protocol) tools. When running tests in
--mode live:search,getIssue,updateIssue) are not passed to the AI modelThis limits testing to only SKILLs with pure logic/LLM generation, excluding integration-based SKILLs from live testing.
Proposed Solution
The simple solution of adding
toolsparameter to LLM API calls is insufficient because skillforge does not handle tool calling workflows:content, nottool_callsWhat's needed (not implemented):
Changes required:
toolsparameter to LLM API callsAlternatives Considered
Feature Category
Testing
Impact
Nice to have
Additional Context
SKILL examples requiring MCP tools:
redmine-duplicate-checker-search,getIssue,updateIssue(Redmine)github-checker-searchIssues,getIssue,createComment(GitHub)Current state comparison:
Impact: