Skip to content

eperez3181/Myopia-Clusters

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Myopia Clusters

By: Estela Perez

Background

The purpose of this assignment was to process raw data to fit the machine learning models. Several clustering algorithms were used to explore where the patients could be placed into distinct groups of patients that would be better to analyze separately. All in effort to predict myopia, or nearsightedness.

Steps

Part 1: Prepared the Data

  • Used Pandas DataFrame to read in the myopia.csv file
  • Removed unecessary column from dataset
  • Standardized dataset

Part 2: Applied Dimensionality Reduction

  • Performed dimensionality reduction with PCA
  • Reduced the dataset dimensions with t-SNE
  • Created a scatter plot of the t-SNE output
  • image

Part 3: Performed a Cluster Analysis with K-means

  • Created an elbow plot to identify the best number of clusters
  • image

Part 4: Made a Recommendation

Based on the data findings, the patients would be able to be clustered. By looking at the scatter plot and elbow plot, I would recommend patients be placed into 5 clusters

About

Used several clustering algorithms to explore whether the patients can be placed into distinct groups, all in the efforts to finding better ways to predict myopia, or nearsightedness

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors