Skip to content

Latest commit

 

History

History
13 lines (7 loc) · 620 Bytes

File metadata and controls

13 lines (7 loc) · 620 Bytes

The goal of this project was to explore and integrate heart disease datasets from four regions (California, Cleveland, Hungary, and Switzerland) using Python and relational database principles.

Key tasks included:

  1. Data exploration and preprocessing using Pandas in Python

  2. Designing an Entity–Relationship (ER) diagram to model how the datasets relate in a relational database system

  3. Creating a normalized heart disease database following 1NF, 2NF, and 3NF using SQLite3

  4. Inserting the regional datasets into the database

  5. Performing SQL queries to analyze and retrieve information from the database