Skip to content

Reduction :: Topic Models

Marielle Lange edited this page Aug 29, 2015 · 1 revision

In Python

  • (many)

In statsR

In Javascript

Techniques

Latent Semantic Analysis (LSA)

Latent semantic analysis (LSA) is a technique in natural language processing, in particular in vectorial semantics, of analyzing relationships between a set of documents and the terms they contain by producing a set of concepts related to the documents and terms. Wikipedia

Latent Dirichlet allocation (LDA)

In natural language processing, Latent Dirichlet allocation (LDA) is a generative model that allows sets of observations to be explained by unobserved groups that explain why some parts of the data are similar. Wikipedia

Topic Modeling

Topic modeling is a form of text mining, a way of identifying patterns in a corpus. You take your corpus and run it through a tool which groups words across the corpus into 'topics' journalofdigitalhumanities

Principal Component Analysis

Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. Wikipedia

Visualisations

Chord Diagram (interactive only)

Plant companions

Word clouds

Topics word clouds and associated blog post

LDA-Based Topic Modelling in Javascript, update

DNA Bar

LDA-Based Topic Modelling in Javascript

Topic correlations

jsLDA

Force directed graph

Visualizing topic models

Bubbles

Wikipedia Visualizations

Tree of clusters

Visualising Structure in Topic Models

Elastic lists (interactive only)

Elastic Lists for Faceted Search