Organizations lack an autonomous onboarding system that can guide new software developers through company processes, personalize their onboarding experience based on role and experience, track completion, and generate structured confirmation to HR β all using a structured company knowledge base.
OnboardAI is an intelligent, agentic onboarding assistant that autonomously guides new developers at NovaByte Technologies through their entire onboarding journey β step by step, verified, and personalized.
It doesn't just answer questions. It plans, verifies, adapts, remembers, and acts β firing real integrations, growing an auto-generated knowledge base, scoring readiness, and notifying HR automatically on completion.
| Asset | Link |
|---|---|
| πΉ Video Walkthrough | https://youtu.be/xhb4uEM2kGg |
| π Presentation Slides | https://drive.google.com/file/d/1jiV71kU49CwCPdyDc8lPhFUm-eTgfo8u/view?usp=drivesdk |
- Personalized onboarding paths based on role (Backend / Frontend / DevOps), level (Intern / Junior / Senior), tech stack, and team
- Step-by-step verified guidance β never marks a task complete without proof
- Setup tasks β asks for terminal output (version numbers, logs)
- Knowledge tasks β asks a quiz question from the actual KB docs
- Attendance tasks β asks for ticket number or PR link
- Strike system β 3 failed verification attempts flags the task as
TASK_STRUGGLEfor HR review - Adaptive guidance by level β Interns get copy-paste commands with explanations; Seniors get high-level peer-style guidance
- Proactive tips β context-aware tips delivered before the user asks, based on their role and team
- Context memory β remembers earlier struggles and references them later in the session
| Integration | Trigger | What Happens |
|---|---|---|
| π¬ Slack | All 5 profile fields collected | Welcome message sent to #general, #onboarding, and team channel |
| π GitHub | User requests access or reaches clone step | Repo access provisioned with clone instructions |
| π Jira | User requests Jira access | Board access granted with sprint and permission details |
| π« Starter Tickets | Checklist reaches ticket step | GitHub issue + Jira ticket auto-assigned with full acceptance criteria |
All integrations are simulated (no real API keys required) β designed for demo purposes per hackathon constraints.
- 11 structured resource files from PS-03 loaded and searchable
- Keyword-based RAG with source citations (
from: filename.md) - Auto-generated FAQs β every question asked during onboarding is saved to the KB with developer context, so future hires benefit automatically
- KB grows smarter with every onboarding session
- Session memory β progress saved to
memory/on every task completion - Returning employees β automatically resume from where they left off with a personalised welcome-back message
- Readiness Score β based on task completion, penalised for strikes and struggles
- Risk Score β flags if onboarding was rushed or suspiciously perfect
- Session summary auto-saved to
session_logs/with full breakdown - HR email sent automatically on completion with name, role, team, completed tasks, pending tasks, scores, and flagged items
Tier 1 β RAG (Always Searchable)
company_overview Β· engineering_standards Β· architecture_documentation
setup_guides Β· policies Β· org_structure Β· onboarding_faq
Tier 2 β Agent Logic (Read On-Demand)
employee_personas β persona matching
onboarding_checklists β checklist generation
starter_tickets β ticket assignment
Tier 3 β Templates (Read Once at Generation)
email_templates β HR completion email
User Message
βββΊ Intent Router (Mistral)
βββΊ INFO_GATHERING β Onboarding Concierge persona
βββΊ TECH_QUERY β Senior Tech Lead persona
βββΊ TASK_COMPLETE β QA Verifier persona
βββΊ GENERAL β General Mentor persona
Syrus2026_caffeine_and_commit/
βββ agent.py # Core agent β logic, memory, FAQ, scoring
βββ app.py # Streamlit UI
βββ rag.py # Keyword-based RAG system
βββ email_sender.py # HR email via Gmail SMTP
βββ mock_slack.py # Slack integration (simulated)
βββ mock_jira.py # Jira integration (simulated)
βββ mock_github.py # GitHub integration (simulated)
βββ env_verifier.py # Auto-verifies terminal output for setup tasks
βββ onboarding_flow.py # Structured Q&A intro flow
βββ knowledge_base/ # 11 PS-03 resource files + auto-generated FAQs
βββ session_logs/ # Per-session summaries with scores (.md)
βββ memory/ # Persistent employee progress (.json)
βββ logo.png # App favicon and chat avatar
βββ .env # API keys (not committed)
βββ requirements.txt
git clone https://github.com/CMPN-CODECELL/Syrus2026_caffeine_and_commit
cd Syrus2026_caffeine_and_commitpython -m venv .venv
# Windows
.venv\Scripts\activate
# Mac / Linux
source .venv/bin/activatepip install -r requirements.txtCreate a .env file in the project root:
# Mistral AI
MISTRAL_API_KEY=your_mistral_api_key
MISTRAL_MODEL=mistral-small-latest
# Gmail SMTP (for HR email)
SENDER_EMAIL=your_gmail@gmail.com
SENDER_PASSWORD=your_gmail_app_password # App Password, not account password
HR_EMAIL=hr@novabyte.com
SMTP_HOST=smtp.gmail.com
SMTP_PORT=587Note on Gmail App Password: Go to Google Account β Security β 2-Step Verification β App Passwords. Generate one for "Mail".
streamlit run app.pystreamlit==1.43.0 # UI framework
mistralai==1.5.1 # LLM (Mistral AI)
chromadb==0.6.3 # Vector store
python-dotenv==1.0.1 # Environment variable management
Pillow # Logo/favicon support
# Email uses stdlib smtplib β no extra package needed
Install all at once:
pip install -r requirements.txt-
User introduces themselves
"Hi, I'm Riya Sharma. I'm a Backend Intern on the Payments team working with Node.js."
-
Agent detects persona β loads personalized 6-step checklist β fires Slack welcome automatically
-
Agent guides step-by-step with verification
- Node.js setup β asks for
node --versionterminal output - Codebase walkthrough β asks a quiz question from
architecture_documentation.md
- Node.js setup β asks for
-
Integrations fire automatically
- GitHub repo access provisioned with clone instructions
- Jira board access granted with sprint details
-
Starter ticket auto-assigned
- GitHub issue
#47+ Jira ticketPAY-247assigned with acceptance criteria
- GitHub issue
-
On completion
- Session summary saved to
session_logs/ - HR email sent with Readiness Score and Risk Score
- Auto-generated FAQ added to
knowledge_base/auto_generated_faqs.md
- Session summary saved to
| Criteria | How We Address It |
|---|---|
| Agent Intelligence (30%) | Multi-agent router, verified task completion, adaptive guidance by level, proactive tips, strike system |
| Knowledge Retrieval (20%) | Keyword RAG with source citations, hallucination prevention via strict KB scope |
| System Architecture (20%) | 3-tier MCP-style design, modular agent + integrations + RAG separation |
| Completion Reporting (10%) | Structured HTML/plain-text HR email with confidence + suspicion scores |
| Bonus Integrations (10%) | GitHub, Slack, Jira simulation; env auto-verifier; auto-generated FAQ KB |
| Name |
|---|
| Mahek Hingorani |
| Navya Rangwani |
| Khushi Dayaramani |
| Twinkle Gupta |
| Layer | Technology |
|---|---|
| LLM | Mistral AI (mistral-small-latest) |
| UI | Streamlit |
| RAG | Custom keyword-based retrieval |
Gmail SMTP (smtplib) |
|
| Integrations | Simulated Slack, GitHub, Jira |
| Storage | Local JSON (memory) Β· Markdown (session logs, FAQs) |
| Favicon & Avatar | Pillow (logo.png) |
