Author: Jenax
Date: 2026-01-13
This repository contains a short exploratory data analysis (EDA) of the seaborn tips dataset (restaurant bills and tips). The notebook inspects the dataset, summarizes distributions, and visualizes comparisons and relationships between variables.
- Comparison: bar plots / count plots to compare groups (e.g.,
total_billbysex). - Distribution: summary statistics and distribution checks for numeric fields (
total_bill,tip,size). - Relationship: scatter plots and simple visual checks to explore relationships between variables (for example
total_billvstip).
tips-data-analyst-report (2).ipynb— primary notebook with EDA, visualizations and notes.- (Optional) exported CSV/XLSX saved inside the notebook if enabled:
tips.csv,tips.xlsx.
- Open the notebook
tips-data-analyst-report (2).ipynbin Jupyter / JupyterLab / VS Code. - Execute cells from top to bottom. The notebook uses seaborn's built-in
tipsdataset (no external data download required).
- Python 3.x
- pandas
- seaborn
- numpy
- matplotlib
You can install the main dependencies with:
pip install pandas seaborn numpy matplotlib
- The dataset is small and clean (no missing values). One duplicate row was detected.
- Consider statistical tests for group differences (e.g., t-test or non-parametric tests for
total_billbysex). - Additional visualizations: boxplots, violin plots, heatmap of correlations.
- If you want to produce a shareable report, export the notebook to HTML or PDF.