Cleaning Data Course Project

The raw data

The raw data contains 561 unlabeled features along with subject and activity identifiers.

features.txt - Contains labels for the features.
X_train.txt & X_test.txt - Contains the accelerometer and gyrosope measures for the subjects and activities.
subject_train.txt & subject_test.txt - Contains a vector of numbers (1-30) that correspond to the test subject. y_train.txt & y_test.txt - Contains a vector of numbers (1-6) that correspond to the activity performed by each subject.

The run_analysis.R script

Reads the UCI HAR data tables into R.
Binds the data tables.
Gives the activity numbers (1-6) descriptive lables.
Changes the activity class from character to factor.
Excludes features that do not contain mean() or std().
Melts the data set into a long table.
Calculates the average measurement for each signal by subject and activity.
Writes a .txt file of the data table to the working directory.

To read the data table back into R, use the command read.table("./HAR_tidy.txt", header = TRUE).

Codebook for the tidy dataset

The repo contains a codebook titled CODEBOOK.md

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CODEBOOK.md		CODEBOOK.md
HAR_tidy.txt		HAR_tidy.txt
README.md		README.md
run_analysis.R		run_analysis.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cleaning Data Course Project

The raw data

The run_analysis.R script

Codebook for the tidy dataset

About

Releases

Packages

Languages

brandenkmurray/cleaningdata

Folders and files

Latest commit

History

Repository files navigation

Cleaning Data Course Project

The raw data

The run_analysis.R script

Codebook for the tidy dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages