Skip to content

DhanshreeMukade/SCT_DA_Task2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SCT_DA_Task2

Data Cleaning and Preparation using Python

Overview

This project demonstrates the process of cleaning and preparing a sales dataset using Python and Pandas. The dataset was analyzed to identify missing values, remove duplicate records, and convert data types to improve data quality and consistency.

Objectives

  • Load the sales dataset using Pandas.
  • Identify and handle missing values.
  • Remove duplicate records.
  • Convert columns to appropriate data types.
  • Export the cleaned dataset for further analysis.

Tools and Technologies

  • Python
  • Pandas
  • VS Code
  • CSV Dataset

Data Cleaning Process

  1. Loaded the dataset into Python.
  2. Inspected the dataset structure and data types.
  3. Identified missing values.
  4. Handled missing values.
  5. Removed duplicate rows.
  6. Converted date columns to datetime format.
  7. Exported the cleaned dataset as a new CSV file.

Key Outcomes

  • Improved data quality and consistency.
  • Removed duplicate records.
  • Standardized data formats.
  • Prepared the dataset for analysis and visualization.

About

Data cleaning and preprocessing project using Python and Pandas, including missing value handling, duplicate removal, data type conversion, and xlsx export.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors