"Bridging the gap between scalable engineering and actionable data insights."
I am a Data-driven Software Engineer with a Master’s in Computer Science from the University of Southern California (GPA 3.7/4.0). I specialize in building high-performance data pipelines and real-time analytics systems. My unique edge is the ability to not just analyze data, but to architect scalable systems that handle it.
🎯 Current Focus:
- Mastering Low-Level Design (LLD) for data systems
- Real-time Stream Analytics (Kafka/Spark)
| Project | Project Details | Tech Stack |
|---|---|---|
| Startup Funding Analysis | Architected a Playwright-based ELT pipeline to centralize fragmented funding data into BigQuery, driving real-time investment insights via Power BI. | Python BigQuery PowerBI Playwright |
| Multi-AI Agent System | Deployed a multi-agent system using CrewAI and LangChain to automate financial reporting, slashing processing time from hours to minutes | crewAI Python LangChain |
| Pandas Open Source | Fixed core bug in Series.isin() affecting large-scale data processing and improved documentation (45k+ stars). |
Python pytest CI/CD |
- 3rd Place Winner at the Alteryx Datathon (Irvine, 2023).
- Open Source Contributor to Pandas (Fixed bugs in
Series.isin()and improved documentation). - Section Leader @ Stanford Code in Place 2025: Mentoring students in Python (Top 900 globally).
- CS Mentor @ Microsoft TEALS: Taught OOPS and Python to 30+ high school students.




