Excel and Python Assignment
Have to install these libraries(Numpy, Pandas,scipy)
pip install numpy
pip install scipy
pip install pandas
pip install sklearn
Part 1: Here I created a pivot table using data of sheet1 & shoed the information following this structure -
a. The data table showed as the Sum of Income as value.
b. The columns is included as the value of Gender and MaritalStatus.
c. The rows is in the following order: Division; Customer Name; ID.
d. In sheet2 there are some IDs. Added a new column to sheet1 and name it “Matched”. Here I have shown the IDs of sheet2 with the IDs of sheet1 and showing the result as True or False Using Formula
- In this part, I have separated the sheet 1 data in a new excel sheet.
- Converted the excel file to csv.
- Loaded the csv file.
- Get rid of the column ID from the data frame.
- Encoded the data to have similar values.
- Useing K-means clustering based on their divisions.
- Submitting the file without omitting the output.