Predicting on new data #6

zachmayer · 2015-12-02T15:26:11Z

It looks like the scikit learn folks are considering an implementation of Barnes-Hut t-SNE that allows for predictions on new data. (They're implementing fit and transform methods, rather than a single fit_transform method).

Would it be possible to do that here, and add a predict method to Rtsne?

The text was updated successfully, but these errors were encountered:

jkrijthe · 2015-12-08T14:50:38Z

Thanks for the suggestion! I haven't had enough time to look into how hard it is to implement this yet, but at first glance it seems to me they keep the locations of the training set fixed and try to find good location in the embedding for the objects to be 'transformed'. I'm not sure I think this makes a lot of sense and it may be worthwhile to see how their discussion plays out, since they also currently do not seem to have a transform method in master. But if people think this type of transform makes sense it could be worthwhile to implement it.

zachmayer · 2015-12-08T16:09:13Z

They HAD a transform method until very recently, but it looks like they just removed it. I'd really love to be able to try this out on new data, but it's probably best to see how their discussion plays out first.

dfalbel · 2016-11-21T13:58:59Z

It's a FAQ question here: https://lvdmaaten.github.io/tsne/
And the answer is this:

Once I have a t-SNE map, how can I embed incoming test points in that map?

t-SNE learns a non-parametric mapping, which means that it does not learn an explicit function that maps data from the input space to the map. Therefore, it is not possible to embed test points in an existing map (although you could re-run t-SNE on the full dataset). A potential approach to deal with this would be to train a multivariate regressor to predict the map location from the input data. Alternatively, you could also make such a regressor minimize the t-SNE loss directly, which is what I did in this paper.

jkrijthe closed this as completed Mar 20, 2020

m-muecke mentioned this issue Apr 23, 2024

Support for tSNE: t-Distributed Stochastic Neighbour Embeddings mlr-org/mlr3pipelines#756

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Predicting on new data #6

Predicting on new data #6

zachmayer commented Dec 2, 2015

jkrijthe commented Dec 8, 2015

zachmayer commented Dec 8, 2015

dfalbel commented Nov 21, 2016

Predicting on new data #6

Predicting on new data #6

Comments

zachmayer commented Dec 2, 2015

jkrijthe commented Dec 8, 2015

zachmayer commented Dec 8, 2015

dfalbel commented Nov 21, 2016