This is a project to demonstrate ability to collect, work with, and clean a data set. The R script, 'run_analysis.R' , does the following:
- Downloads a file containing a dataset
- unzips the file and sets a working directory
- Loads the activity and feature info
- Loads both the training and test datasets, keeping only those columns which reflect a mean or standard deviation
- Loads the activity and subject data for each dataset, and merges those columns with the dataset
- Merges the two datasets
- Converts the 'activity' and 'subject' columns into factors
- Creates a tidy dataset that consists of the average (mean) value of each variable for each subject and activity pair.
The end result of a tidy dataset is shown in the file 'tidy.txt'.