t-SNE Explorer

This is a demo of the Dash interactive Python framework developed by Plotly.

Dash abstracts away all of the technologies and protocols required to build an interactive web-based application and is a simple and effective way to bind a user interface around your Python code. To learn more check out our documentation.

For an introductory and extensive explanation of t-SNE how to use it properly, please check out the demo app.

Getting Started

Using the demo

To get started, choose a dataset you want to visualize. When the scatter plot appears on the graph, you can see the original image by clicking on a data point.

Alternatively, you can explore the GloVe Word Vectors datasets, which are encoded vectors of large collection of texts from Wikipedia, Twitter, and acquired through Web Crawlers. Upon clicking a data point, you will be able to see the 5 closest neighbors of the word you clicked.

Running the app locally

First create a virtual environment with conda or venv inside a temp folder, then activate it.

virtualenv dash-tsne-venv

# Windows
dash-tsne-venv\Scripts\activate
# Or Linux
source venv/bin/activate

Clone the git repo, then install the requirements with pip

git clone https://github.com/plotly/dash-tsne.git
cd dash-tsne
pip install -r requirements.txt

Run the app

python app.py

How to use the local version

To train your own t-SNE algorithm, input a high-dimensional dataset with only numerical values, and the corresponding labels inside the upload fields. For convenience, small sample datasets are included inside the data directory. You can also download them here. The training can take a lot of time depending on the size of the dataset (the complete MNIST dataset could take 15-30 min), and it is not advised to refresh the webpage when you are doing so.

Generating data

generate_data.py is included to download, flatten and normalize datasets, so that they can be directly used in this app. It uses keras.datasets, which means that you need install keras. To use the script, simply go to the path containing it and run in terminal:

python generate_data.py [dataset_name] [sample_size]

which will create the csv file with the corresponding parameters. At the moment, we have the following datasets:

MNIST
CIFAR10
Fashion_MNIST

About the app

What is t-SNE?

t-distributed stochastic neighbor embedding, created by van der Maaten and Hinton in 2008, is a visualization algorithm that reduce a high-dimensional space (e.g. an image or a word embedding) into two or three dimensions, facilitating visualization of the data distribution.

A classical example is MNIST, a dataset of 60,000 handwritten digits, 28x28 grayscale. Upon reducing the set of images using t-SNE, you can see all the digit clustered together, with few outliers caused by poor calligraphy. You can read a detailed explanation of the algorithm on van der Maaten's personal blog.

Built With

Dash - Main server and interactive components
Plotly Python - Used to create the interactive plots
Scikit-Learn - Run the t-SNE algorithm

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

Authors

Xing Han Lu - Initial Work - @xhlulu

See also the list of contributors who participated in this project.

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

Screenshots

The following are screenshots for the demo app:

The following are screenshots for the full (local) app:

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
assets		assets
data		data
demo_embeddings		demo_embeddings
notebooks		notebooks
screenshots		screenshots
.gitignore		.gitignore
LICENSE.md		LICENSE.md
Procfile		Procfile
README.md		README.md
app.py		app.py
config.py		config.py
custom_styles.css		custom_styles.css
demo.py		demo.py
demo_description.md		demo_description.md
generate_data.py		generate_data.py
generate_demo_embeddings.py		generate_demo_embeddings.py
local.py		local.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

t-SNE Explorer

Getting Started

Using the demo

Running the app locally

How to use the local version

Generating data

About the app

What is t-SNE?

Built With

Contributing

Authors

License

Acknowledgments

Screenshots

About

Releases

Packages

Languages

License

gjerman/dash-tsne

Folders and files

Latest commit

History

Repository files navigation

t-SNE Explorer

Getting Started

Using the demo

Running the app locally

How to use the local version

Generating data

About the app

What is t-SNE?

Built With

Contributing

Authors

License

Acknowledgments

Screenshots

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages