This project analyzes global layoffs across various industries and companies. The data highlights the impact of layoffs on employees, companies, and industries. The main goals are to:
- Identify trends in layoffs across time, regions, and industries.
- Quantify the severity of layoffs and understand their correlation with factors such as funding stage.
- Provide actionable insights through visualizations and statistical analyses.
The dataset contains 1,995 records with the following columns:
| Column | Description |
|---|---|
| company | Name of the company. |
| location | Location of the company (city). |
| industry | Type of industry (e.g., Media, Retail, Technology). |
| total_laid_off | Total number of employees laid off. |
| percentage_laid_off | Percentage of employees laid off. |
| date | Date when the layoffs occurred. |
| stage | Funding stage of the company (e.g., Series B, Post-IPO). |
| country | Country where the company is based. |
| funds_raised_millions | Total funds raised by the company in millions of dollars. |
The data cleaning process involved the following steps:
- Removing Duplicates: A staging table was created, and duplicate records were removed to maintain data integrity.
- Handling Null Values: Key columns such as
total_laid_offandindustrywere carefully imputed or cleaned to avoid skewing analyses. - Standardization: Date formats and text columns were standardized for consistency.
- Column Selection: Unnecessary columns were dropped for a cleaner dataset.
The EDA process included the following:
- Data Loading:
- The
layoffs_staging2.csvcleaned dataset was used.
-
Layoffs By Industry:
- Correlation between
industryandtotal_laid_off.
- Correlation between
-
Trends in Layoffs over Time:
- Layoffs By Month.
- Layoffs By Year.
-
Layoffs By Stage of Funding:
- Correlation between
stageandtotal_laid_off.
- Correlation between
-
Countries with the Most Layoffs
- Correlation between
countryandtotal_laid_off.
- Correlation between
-
Additional Insights:
- Companies with 100% layoffs were identified.
- Top companies with the highest total layoffs were listed.
- Industries Most Affected: The Consumer and Retail sectors experienced the highest layoffs, driven by economic downturns and shifts in business models.
- Funding Stages: Companies at the Post-IPO stage of funding were more vulnerable to layoffs, showing that companies that go public are most vulnerable to layoffs.
- Geographic Distribution: The United States and India were the most affected countries in terms of layoffs, reflecting their large startup ecosystems.
- Trends Over Time: Layoffs peaked during periods of economic uncertainty and decreased as market conditions stabilized.
- Support Vulnerable Sectors: Focus on providing support and funding to industries most affected by layoffs, especially during economic downturns.
- Monitor Company Funding Stages: Monitor companies at specific funding stages for early signs of mass layoffs. Especially companies at the Post-IPO stage.
- Geographic Focus: Policymakers should provide region-specific support, especially in countries with high layoff rates.
- Improve Data Collection: More granular data on employee roles and layoff reasons can enhance predictive models for future workforce planning.
-
Data Cleaning:
- Refer to
01_Data_Cleaning.sqlfor SQL scripts used in data cleaning. - Ensure the cleaned dataset is ready for analysis.
- Refer to
-
EDA:
- Refer to
02_EDA.sqlandworld_layoffs_eda.ipynbfor exploratory data analysis. - Review the notebook as well as the
world_layoffs_report.pdffor visualizations and insights.
- Refer to
-
Insights and Recommendations:
- Summarized in this README file, Jupyter Notebook as well as the
world_layoffs_report.pdfreport.
- Summarized in this README file, Jupyter Notebook as well as the
- Build predictive models to forecast future layoffs.
- Incorporate additional datasets for a more comprehensive analysis.
- Explore deeper causal relationships using advanced statistical methods.
- The dataset was curated from publicly available information on layoffs.
- Data cleaning and preparation scripts were authored to ensure data quality.