Analyze retail sales data to understand revenue trends, seasonal patterns, and product-level performance, and identify what drives revenue volatility.
Retail sales transactional dataset sourced from Kaggle.
- date
- product_ID
- quantity_sold
- sales_price
- Category
- customer_id
- age
- gender
- purchase_frequency
- total_purchase_amount
The dataset contains mixed date formats, missing values, and real-world data inconsistencies.
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
Retail-Sales-Analysis/ ├── Data/ ├── Notebooks/ ├── Images/ ├── Report.pdf ├── README.md
- Parsed mixed-format date values
- Removed duplicate and invalid records
- Handled missing product, quantity, and price values
- Created revenue feature (quantity × price)
- Performed monthly revenue and sales volume analysis
- Identified top-performing products and categories
- Monthly revenue shows high volatility across time
- Sales volume remains comparatively stable month-to-month
- Revenue fluctuations are driven by seasonality and pricing, not order volume
- A small subset of products contributes disproportionately to total revenue
Revenue volatility in retail sales is primarily influenced by seasonal demand and price variation rather than changes in sales volume. Businesses can improve revenue stability through pricing optimization and category-level planning.



