Skip to content
This repository has been archived by the owner on Jul 20, 2021. It is now read-only.
/ sotu_cluster Public archive

Text Clustering on the text of each State of the Union address.

Notifications You must be signed in to change notification settings

frankdevans/sotu_cluster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

State of the Union Text Clustering

Description: this repo is a text clustering project using the Presidential State of the Union addresses as the document corpus. The project utilizes the R language is R Markdown as a publish format. The project exists in 2 parallel iterations.

  • Script: the R script version consists of 3 files: preprocess.R, eda.R, and cluster.R. They contain only the code necessary to replicate the subject of the analysis.
  • Publication: the R Markdown file sotu_cluster.RMD contains the entire end-to-end analysis. The code consists entirely of code also available among the script files in addition to the analysis write-up. The published version of the final product is publically available at: RPubs.

Environment Notes: Though built to be platform agnostic via R, this analysis was developed on R 3.2.1 ("World-Famous Astronaut"), RStudio 0.99.442, and Windows 7.

About

Text Clustering on the text of each State of the Union address.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages