Complete end-to-end data analytics project demonstrating the full workflow from data cleaning to insights using Python, SQL, and Power BI.
This project analyzes customer shopping behavior to uncover patterns, trends, and insights that drive business decisions. It combines data engineering, exploratory analysis, and interactive visualization to tell a complete data story.
Key Value: Transforms raw customer data into actionable business insights through rigorous analysis and compelling visualizations.
- Understand Customer Behavior: Identify purchasing patterns across demographics and categories
- Segment Analysis: Discover distinct customer segments and their characteristics
- Trend Identification: Uncover seasonal and temporal trends in customer activity
- Business Insights: Provide data-driven recommendations for marketing and sales strategies
Customer_Behaviour_Analysis/
├── customer_shopping_behavior.csv # Raw data source
├── data_cleaning.py # Data preprocessing
├── customer_behaviour_analysis.ipynb # Complete analysis notebook
├── customer_behaviour_queries.sql # SQL queries
├── customer_behaviour_dashboard.pbix # Power BI dashboard
└── README.md # Documentation
- Python: Pandas, NumPy, Matplotlib, Seaborn
- Data Cleaning: Handling missing values, outliers, data type conversions
- Exploratory Data Analysis (EDA): Statistical analysis and visualization
- SQL: MySQL for querying and data aggregation
- Query Optimization: Efficient data retrieval and transformation
- Power BI: Interactive dashboards and reports
- Data Visualization: Creating compelling visual narratives
- Total Customers Analyzed: 3.9K
- Gender Distribution: Female: 1.248K Male: 2.652K
- Most Popular Product Categories: Clothing
- Average Purchase Value: $59.76
pip install pandas numpy matplotlib seaborn jupyterpython data_cleaning.pyjupyter notebook customer_behaviour_analysis.ipynbRun queries in customer_behaviour_queries.sql
Open customer_behaviour_dashboard.pbix in Power BI Desktop
✅ Customer Overview: Total customers, average age, gender distribution ✅ Purchase Analytics: Transaction volume, average order value ✅ Seasonal Trends: Month-over-month analysis ✅ Segment Performance: Metrics across customer segments
- Data Analysis & Statistical Analysis
- SQL Query Optimization
- Python Data Manipulation
- Data Visualization & Storytelling
- Business Intelligence
Interested in collaborating? Feel free to reach out on LinkedIn
Last Updated: January 11, 2026 Author: Charles Maina Irungu | GitHub | LinkedIn