Skip to content

anirva09/retail-data-sql-analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

📊 retail_data-SQL-Analytics

📌 Overview

This project focuses on analyzing retail customer purchase data using SQL and Python to extract meaningful business insights. It covers revenue trends, customer behavior, product performance, and regional analysis.


🎯 Objectives

  • Analyze overall revenue and growth patterns
  • Identify high-value customers and segments
  • Evaluate product and category performance
  • Understand geographic distribution of sales
  • Estimate customer retention and churn

🛠️ Tech Stack

  • SQL (MySQL) – Data querying and transformation
  • Python (Pandas, Matplotlib) – Data analysis & visualization
  • Jupyter Notebook – Development environment

📂 Dataset Description

The dataset includes:


📊 Key Analysis Performed

🔹 Revenue Analysis

  • Total Revenue
  • Average Order Value (AOV)
  • Revenue by City and Product Category

🔹 Customer Analysis

  • Customer Lifetime Value (CLTV)
  • Top Customers by Revenue
  • Repeat Purchase Rate
  • Churn Rate (approximated)

🔹 Product Analysis

  • Top Product Categories
  • Brand-wise Performance

🔹 Geographic Analysis

  • Top Cities by Revenue

🔹 Growth Analysis

  • Monthly and Yearly Revenue Trends

📈 Visualizations

  • 📊 Bar Chart – Revenue by City
  • 📈 Line Chart – Monthly Revenue Trend

🔍 Key Insights

  • A small percentage of customers contribute a large portion of total revenue
  • Certain cities dominate overall sales, indicating regional concentration
  • Specific product categories consistently outperform others
  • Repeat customers significantly impact business revenue

⚠️ Limitations

  • Customer Acquisition Cost (CAC) not calculated due to lack of marketing data
  • Profit estimated as no cost data is available
  • Churn rate approximated based on inactivity

🚀 Future Improvements

  • Build an interactive dashboard using Power BI or Tableau
  • Include real profit and cost analysis
  • Perform advanced segmentation using RFM analysis
  • Add predictive analytics (sales forecasting)

📎 How to Run the Project

  1. Load the dataset into MySQL
  2. Connect MySQL with Python using SQLAlchemy
  3. Run the Jupyter Notebook for analysis and visualization

📌 Project Highlights

  • End-to-end data analysis using SQL + Python
  • Business-focused insights and metrics
  • Clean and structured analytical workflow

About

Retail Data Analytics using SQL and Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors