Final project for the Coursera course "Getting and Cleaning Data" from JHU
Using the following link for the data source: https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip This project contains one R script called run_analysis.R that does the following:
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive activity names.
- Creates a second, independent tidy data set with the average of each variable for each activity and each subject.
- Download the data source and put into a folder on your local drive. You'll have a
UCI HAR Dataset
folder. - Put
run_analysis.R
in the parent folder ofUCI HAR Dataset
, then set it as your working directory usingsetwd()
function in R. - Run
source("run_analysis.R")
, then it will generate a new filetiny_data.txt
in your working directory. - The script reads the data file back in to the object
my_data
at the end. If you wish to check that this worked properly run the R commandView(my_data)
run_analysis.R
depends on reshape2
.