Skip to content

CMPN-CODECELL/Syrus2026_caffeine_and_commit

Repository files navigation

OnboardAI Logo

OnboardAI β€” Autonomous Developer Onboarding Agent

NovaByte Technologies Β Β·Β  Syrus 2026 Β Β·Β  Team Caffeine & Commit


πŸ“Œ Problem Statement β€” PS-03

Organizations lack an autonomous onboarding system that can guide new software developers through company processes, personalize their onboarding experience based on role and experience, track completion, and generate structured confirmation to HR β€” all using a structured company knowledge base.


πŸ’‘ What We Built

OnboardAI is an intelligent, agentic onboarding assistant that autonomously guides new developers at NovaByte Technologies through their entire onboarding journey β€” step by step, verified, and personalized.

It doesn't just answer questions. It plans, verifies, adapts, remembers, and acts β€” firing real integrations, growing an auto-generated knowledge base, scoring readiness, and notifying HR automatically on completion.


🎬 Demo & Presentation

Asset Link
πŸ“Ή Video Walkthrough https://youtu.be/xhb4uEM2kGg
πŸ“Š Presentation Slides https://drive.google.com/file/d/1jiV71kU49CwCPdyDc8lPhFUm-eTgfo8u/view?usp=drivesdk

✨ Core Features

🧠 Agent Intelligence

  • Personalized onboarding paths based on role (Backend / Frontend / DevOps), level (Intern / Junior / Senior), tech stack, and team
  • Step-by-step verified guidance β€” never marks a task complete without proof
    • Setup tasks β†’ asks for terminal output (version numbers, logs)
    • Knowledge tasks β†’ asks a quiz question from the actual KB docs
    • Attendance tasks β†’ asks for ticket number or PR link
  • Strike system β€” 3 failed verification attempts flags the task as TASK_STRUGGLE for HR review
  • Adaptive guidance by level β€” Interns get copy-paste commands with explanations; Seniors get high-level peer-style guidance
  • Proactive tips β€” context-aware tips delivered before the user asks, based on their role and team
  • Context memory β€” remembers earlier struggles and references them later in the session

πŸ”— Integrations (MCP-Style 3-Tier Architecture)

Integration Trigger What Happens
πŸ’¬ Slack All 5 profile fields collected Welcome message sent to #general, #onboarding, and team channel
πŸ™ GitHub User requests access or reaches clone step Repo access provisioned with clone instructions
πŸ“‹ Jira User requests Jira access Board access granted with sprint and permission details
🎫 Starter Tickets Checklist reaches ticket step GitHub issue + Jira ticket auto-assigned with full acceptance criteria

All integrations are simulated (no real API keys required) β€” designed for demo purposes per hackathon constraints.

πŸ“š Knowledge Base & RAG

  • 11 structured resource files from PS-03 loaded and searchable
  • Keyword-based RAG with source citations (from: filename.md)
  • Auto-generated FAQs β€” every question asked during onboarding is saved to the KB with developer context, so future hires benefit automatically
  • KB grows smarter with every onboarding session

πŸ’Ύ Memory & Persistence

  • Session memory β€” progress saved to memory/ on every task completion
  • Returning employees β€” automatically resume from where they left off with a personalised welcome-back message

πŸ“Š Scoring & HR Reporting

  • Readiness Score β€” based on task completion, penalised for strikes and struggles
  • Risk Score β€” flags if onboarding was rushed or suspiciously perfect
  • Session summary auto-saved to session_logs/ with full breakdown
  • HR email sent automatically on completion with name, role, team, completed tasks, pending tasks, scores, and flagged items

πŸ—οΈ Architecture

3-Tier Agent Design (MCP-Style)

Tier 1 β€” RAG (Always Searchable)
  company_overview Β· engineering_standards Β· architecture_documentation
  setup_guides Β· policies Β· org_structure Β· onboarding_faq

Tier 2 β€” Agent Logic (Read On-Demand)
  employee_personas β†’ persona matching
  onboarding_checklists β†’ checklist generation
  starter_tickets β†’ ticket assignment

Tier 3 β€” Templates (Read Once at Generation)
  email_templates β†’ HR completion email

Multi-Agent Router

User Message
    └─► Intent Router (Mistral)
            β”œβ”€β–Ί INFO_GATHERING   β†’ Onboarding Concierge persona
            β”œβ”€β–Ί TECH_QUERY       β†’ Senior Tech Lead persona
            β”œβ”€β–Ί TASK_COMPLETE    β†’ QA Verifier persona
            └─► GENERAL          β†’ General Mentor persona

File Structure

Syrus2026_caffeine_and_commit/
β”œβ”€β”€ agent.py                  # Core agent β€” logic, memory, FAQ, scoring
β”œβ”€β”€ app.py                    # Streamlit UI
β”œβ”€β”€ rag.py                    # Keyword-based RAG system
β”œβ”€β”€ email_sender.py           # HR email via Gmail SMTP
β”œβ”€β”€ mock_slack.py             # Slack integration (simulated)
β”œβ”€β”€ mock_jira.py              # Jira integration (simulated)
β”œβ”€β”€ mock_github.py            # GitHub integration (simulated)
β”œβ”€β”€ env_verifier.py           # Auto-verifies terminal output for setup tasks
β”œβ”€β”€ onboarding_flow.py        # Structured Q&A intro flow
β”œβ”€β”€ knowledge_base/           # 11 PS-03 resource files + auto-generated FAQs
β”œβ”€β”€ session_logs/             # Per-session summaries with scores (.md)
β”œβ”€β”€ memory/                   # Persistent employee progress (.json)
β”œβ”€β”€ logo.png                  # App favicon and chat avatar
β”œβ”€β”€ .env                      # API keys (not committed)
└── requirements.txt

πŸš€ Setup & Run

1. Clone the repository

git clone https://github.com/CMPN-CODECELL/Syrus2026_caffeine_and_commit
cd Syrus2026_caffeine_and_commit

2. Create and activate a virtual environment

python -m venv .venv

# Windows
.venv\Scripts\activate

# Mac / Linux
source .venv/bin/activate

3. Install dependencies

pip install -r requirements.txt

4. Configure environment variables

Create a .env file in the project root:

# Mistral AI
MISTRAL_API_KEY=your_mistral_api_key
MISTRAL_MODEL=mistral-small-latest

# Gmail SMTP (for HR email)
SENDER_EMAIL=your_gmail@gmail.com
SENDER_PASSWORD=your_gmail_app_password   # App Password, not account password
HR_EMAIL=hr@novabyte.com
SMTP_HOST=smtp.gmail.com
SMTP_PORT=587

Note on Gmail App Password: Go to Google Account β†’ Security β†’ 2-Step Verification β†’ App Passwords. Generate one for "Mail".

5. Run the application

streamlit run app.py

πŸ“¦ Dependencies

streamlit==1.43.0       # UI framework
mistralai==1.5.1        # LLM (Mistral AI)
chromadb==0.6.3         # Vector store
python-dotenv==1.0.1    # Environment variable management
Pillow                  # Logo/favicon support
# Email uses stdlib smtplib β€” no extra package needed

Install all at once:

pip install -r requirements.txt

🎬 Sample Demo Flow

  1. User introduces themselves

    "Hi, I'm Riya Sharma. I'm a Backend Intern on the Payments team working with Node.js."

  2. Agent detects persona β†’ loads personalized 6-step checklist β†’ fires Slack welcome automatically

  3. Agent guides step-by-step with verification

    • Node.js setup β†’ asks for node --version terminal output
    • Codebase walkthrough β†’ asks a quiz question from architecture_documentation.md
  4. Integrations fire automatically

    • GitHub repo access provisioned with clone instructions
    • Jira board access granted with sprint details
  5. Starter ticket auto-assigned

    • GitHub issue #47 + Jira ticket PAY-247 assigned with acceptance criteria
  6. On completion

    • Session summary saved to session_logs/
    • HR email sent with Readiness Score and Risk Score
    • Auto-generated FAQ added to knowledge_base/auto_generated_faqs.md

πŸ“Š Evaluation Criteria Coverage

Criteria How We Address It
Agent Intelligence (30%) Multi-agent router, verified task completion, adaptive guidance by level, proactive tips, strike system
Knowledge Retrieval (20%) Keyword RAG with source citations, hallucination prevention via strict KB scope
System Architecture (20%) 3-tier MCP-style design, modular agent + integrations + RAG separation
Completion Reporting (10%) Structured HTML/plain-text HR email with confidence + suspicion scores
Bonus Integrations (10%) GitHub, Slack, Jira simulation; env auto-verifier; auto-generated FAQ KB

πŸ‘₯ Team β€” Caffeine & Commit

Name
Mahek Hingorani
Navya Rangwani
Khushi Dayaramani
Twinkle Gupta

πŸ› οΈ Tech Stack

Layer Technology
LLM Mistral AI (mistral-small-latest)
UI Streamlit
RAG Custom keyword-based retrieval
Email Gmail SMTP (smtplib)
Integrations Simulated Slack, GitHub, Jira
Storage Local JSON (memory) Β· Markdown (session logs, FAQs)
Favicon & Avatar Pillow (logo.png)

Built with β˜• and commits by Team Caffeine & Commit Β Β·Β  Syrus 2026 Β Β·Β  NovaByte Technologies

About

Repository for caffeine and commit for Syrus-2026

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors