Skip to content

Latest commit

 

History

History
40 lines (27 loc) · 1.77 KB

File metadata and controls

40 lines (27 loc) · 1.77 KB

Stack Overflow Data Analysis Project

This project is a personal exploration of data analysis, data wrangling, web scraping, API usage, and data visualization using real-world datasets from Stack Overflow and job postings. The goal is to practice and demonstrate end-to-end data skills in Python, from data collection to insight generation.

Project Structure

  • data_collection/
    • 02_api_access_and_requests.ipynb: Demonstrates how to access APIs and use HTTP requests in Python.
    • 03_explore_dataset.ipynb: Explores and analyzes the Stack Overflow survey dataset.
    • job_data_collection.ipynb: Collects and processes job posting data from APIs and files.
    • web_scraping_basics.ipynb: Shows basic web scraping techniques in Python.
    • web_scraping_review.ipynb: Reviews and practices more advanced web scraping.
    • Jobs_API.ipynb: Implements a simple jobs API using Flask for demonstration.
  • data_wrangling/
    • 01_data_wrangling.ipynb: Covers data cleaning, deduplication, and handling missing values.
  • data_visualization/
    • data_visualization.ipynb: Visualizes survey and job data using Python libraries.
  • data_analysis/
    • exploratory_data_analysis.ipynb: Performs exploratory data analysis on the survey data.
  • data_dashboard/
    • Contains dashboard files and additional datasets for further analysis.

Datasets

  • Stack Overflow survey data (CSV, SQLite)
  • Job postings data (JSON, CSV)

How to Use

  1. Open the notebooks in the order above or jump to any topic of interest.
  2. Run the code cells to see data loading, cleaning, analysis, and visualization in action.
  3. Modify or extend the notebooks for your own experiments.

Created and maintained as a personal learning project. All code and analysis are for educational purposes.