Word Cloud Visualization

Description:

This node allows you to create a word cloud from text data coming from one of three sources. The node can create a visualization for data in a modeler data object, it can read a single URL and parse the text using a CSS selector, or it can read a local .txt file. The extension has options for text cleaning, removing punctuation, numbers, and English stop words. It also has options for the displaying the word cloud.

User Interface 1:

The first tab for this node is Text Source.

First chose the source of the text data, either from Modeler, Web, or Local file. This is selected using the radio button on the left side.

For Modeler Data, select one or more columns from the dataset containing text. If one column is selected, an option is available for creating a word cloud for each row of text in that column. This option is enabled by clicking the check box below the field selector. If that box is unchecked, or multiple columns are selected then the each column is concatenated into a single string and used to create a word cloud per column.

For Web Text, enter the URL containing the text for analysis and the appropriate CSS selector. If you are unfamiliar with CSS Selectors, I recommend a tool like http://selectorgadget.com/ to help find the correct HTML elements.

Local Text source files must be in a .txt format. One word cloud is generated for a text file.

User Interface 2:

The second tab for this node is Display & Save Options.

The tab contains options for text preparation, defaulting to removing punctuation, numbers, and (‘english’) stop words from the text.

The word cloud display group of parameters adjust the colors used based on the R Color Brewer package. The minimum frequency of words to be used in the word cloud, the maximum number of words to display, and the rotation percent of words can be set in this section. You can also print the words with their respective frequencies by checking the box in this section.

The Save option will create a .png file containing the word cloud(s) generated by the node. If multiple files are created (for multiple word clouds) then a value at the end of the file name will increment for each file. The width and height values in this section are in inches.

Requirements

IBM SPSS Modeler v16 or later
‘R Essentials for SPSS Modeler’ plugin: Download here
R 2.15.x or R 3.1 (Use this link to find the correct version)

Installation instructions

Download the extension: Download
Close IBM SPSS Modeler. Save the .cfe file in the CDB directory, located by default on Windows in "C:\ProgramData\IBM\SPSS\Modeler\version\CDB" or under your IBM SPSS Modeler installation directory. Note: this is a hidden directory, so you need to type it in manually or copy/paste the file path.
Restart IBM SPSS Modeler, the node will now appear in the Output palette.

R Packages used

The R packages will be installed the first time the node is used as long as an Internet connection is available.

Documentation and samples

Find a PDF with the documentation of this extension in the Documentation directory
There is a sample available in the Example directory

Known issues

Modeler support R 3.2.x only, but install.package may install latest package only, which sametimes install failed. As a workaround, please install old version dependency package manaully. Refer to this article http://stackoverflow.com/questions/17082341/installing-older-version-of-r-package about how to install old version package.

License

Apache 2.0

Contributors

Greg Filla (gdfilla)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Documentation		Documentation
Example		Example
Screenshot		Screenshot
Source code		Source code
LICENSE		LICENSE
README.md		README.md
default.png		default.png
info.json		info.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Word Cloud Visualization

Description:

User Interface 1:

User Interface 2:

Requirements

Installation instructions

R Packages used

Documentation and samples

Known issues

License

Contributors

About

Releases 1

Packages

Languages

License

IBMPredictiveAnalytics/Word_Cloud_Visualization

Folders and files

Latest commit

History

Repository files navigation

Word Cloud Visualization

Description:

User Interface 1:

User Interface 2:

Requirements

Installation instructions

R Packages used

Documentation and samples

Known issues

License

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages