Current version uses deterministic rule-based heuristics (index.js, main.tsx, server.py, etc.).
Future improvement:
- Detect React entry via ReactDOM.render / createRoot
- Detect Express entry via express() initialization
- Detect Python entry via
if __name__ == "__main__" - Detect Next.js via pages/ or app/ directory structure
- Use lightweight AST parsing for higher accuracy
Currently supports common extensions: .js, .jsx, .ts, .tsx, .py, .java, .go, .rs, .cpp, etc.
Future improvement:
- Add support for 100+ language extensions
- Improve classification of config files
- Better separation of source vs build artifacts
Reference extension list: https://gist.github.com/ppisarczky/43962d06686722d26d176fad46879d41
Future enhancement:
- Detect MVC pattern
- Detect Monorepo structure (pnpm-workspace.yaml, turbo.json)
- Detect Microservice layout
- Detect frontend/backend separation automatically
Future improvement:
- Build file-level import graph
- Rank most central files
- Identify core modules
- Suggest learning order dynamically
Future enhancement:
- Detect Docker usage
- Detect GitHub Actions workflows
- Detect testing frameworks (Jest, Vitest, PyTest)
- Detect database usage (Prisma, Mongoose, SQLAlchemy)
After deterministic parsing:
- Feed structured metadata to LLM
- Generate step-by-step learning roadmap
- Suggest contribution strategy
- Explain architecture like a mentor
- Automatically detect if the repository follows a fullstack structure
(e.g., separate
frontend/andbackend/folders). - Classify architecture as:
- Fullstack Monorepo
- Frontend + API Server
- Microservices-based
- Provide separate learning paths for frontend and backend layers.
- Identify shared configuration files (e.g., Docker, CI/CD, env files).
This will allow CodePath to understand multi-layered repositories and generate structured learning guidance per layer.
In the next version, CodeXPath will move beyond structural detection and implement deeper static analysis capabilities.
- Parse file import/export relationships.
- Build dependency graph between modules.
- Identify central/high-impact files.
- Detect circular dependencies.
- Recommend learning order based on dependency depth.
This will allow CodeXPath to suggest which files must be understood first for maximum clarity.
- Detect MongoDB (mongoose)
- Detect PostgreSQL (pg, prisma, sequelize)
- Detect MySQL
- Detect Firebase
- Identify ORM usage
- Detect schema definition files
This enables:
- Database-specific learning recommendations
- Schema-focused roadmap generation
- Detect JWT usage
- Detect Passport.js
- Detect NextAuth
- Detect OAuth providers
- Detect middleware-based auth guards
This allows CodeXPath to:
- Suggest learning about auth flow
- Highlight security-critical files
- Detect REST route patterns
- Detect GraphQL schema files
- Detect Apollo Server usage
- Detect resolvers
This enables:
- API architecture classification
- Backend communication roadmap guidance
- Detect Jest
- Detect Vitest
- Detect Cypress
- Detect Playwright
- Detect testing directories
This allows:
- Quality maturity assessment
- Test-driven learning recommendations
Future roadmap engine will:
- Combine architecture + import graph + framework + database detection
- Generate structured multi-layer learning plans
- Estimate difficulty progression
- Highlight advanced topics automatically
Example:
Step 1: Understand project bootstrap
Step 2: Study routing and request lifecycle
Step 3: Learn authentication flow
Step 4: Analyze database modeling
Step 5: Explore testing patterns
- Estimate project complexity
- Classify as Beginner / Intermediate / Advanced
- Provide learning time estimate
- Detect frontend/backend split
- Generate separate learning tracks per layer
- Identify integration points