[FEA] Promote FITSNE from experimental #3805

divyegala · 2021-04-29T00:34:11Z

I'll attach embeddings of some popular datasets using cuML t-sne (FFT), cuML t-sne (BH), and sklearn t-sne (BH)

divyegala · 2021-04-29T00:35:33Z

Boston Dataset:

cuML FFT
cuML BH
sklearn BH

divyegala · 2021-04-29T00:36:42Z

Breast Cancer Dataset:

cuML FFT
cuML BH
sklearn BH

divyegala · 2021-04-29T00:37:49Z

Diabetes Dataset:

cuML FFT
cuML BH
sklearn BH

divyegala · 2021-04-29T00:39:57Z

Digits Dataset:

cuML FFT
cuML BH
sklearn BH

divyegala · 2021-04-29T00:40:57Z

Iris Dataset:

cuML FFT
cuML BH
sklearn BH

zbjornson · 2021-04-30T16:26:03Z

It looks like a lot of those differences can be reduced by adjusting late_exaggeration.

It also looks like there are flyaways in the Iris dataset in cuML B-H.

github-actions · 2021-06-02T15:25:28Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

cjnolet · 2021-09-22T18:34:07Z

Just want to provide an update here for some tasks that should be completed before we officially promote the FFT TSNE from experimental (and potentially make it the default option). We've shown that the FFT algorithm works well on toy datasets but there has been evidence to suggest that 1) it may not always be providing better results over Barnes-hut, and 2) It may not be faster. To my knowledge, we have not done any formal benchmarks or scale analysis of the FFT algorithm, outside of the initial results that @zbjornson has been gracious enough to perform.

At the very minimum, we should want the results to be at least as correct and performant as Barnes-hut in order to make the FFT variant the default option. If more stable results but not quite as performance, we could probalby still move FFT out of experimental but not make it the default.

From my perspective, there are reall two tasks that remain:

Benchmark performance and scale on both real-world datasets and toy datasets
Evaluate the stability / correctness on real-world datasets and at scale

One very easy place to try this is https://github.com/clara-parabricks/rapids-single-cell-examples/blob/master/notebooks/hlca_lung_gpu_analysis.ipynb

lowener · 2021-11-12T20:56:43Z

I ran TSNE on a few real-world datasets to observe the embeddings and the performance.

From what I found, FFT has correct results. Not always better or worse than B-H.
FFT is most of the time slower than Barnes-Hut when the number of iterations is lower than 1500. After that treshold, Barnes-Hut is slower than FFT.

I'll add here the results, starting with the dataset that you linked. (shape = (65462, 20))

lowener · 2021-11-12T20:59:55Z

On a Spotify dataset from Kaggle (link) with shape (2017, 8) and 500 iterations

lowener · 2021-11-12T21:10:24Z

On a Credit card fraud detection from Kaggle too (link) with multiple iterations choices. Shape=(3492, 30)

cjnolet · 2021-11-15T19:32:43Z

@lowener, these plots and benchmarks look great and it's nice to see the performance gap isn't prohibitively large between FFT and BH on these datasets. So long as they are converging in a reasonable number of iterations (which look to be similar-ish across BH and FFT), this could be evidence for supporting FFT as the default.

Ideally, before we officially promote FFT to a default, we should also address this concern (since these datasets don't seem to be showing such a signficant slowdown across differing numbers of iterations): #3865 (comment)

zbjornson · 2021-11-15T19:48:53Z

The runtimes for these examples are only a few seconds. When I benchmarked larger datasets that take between 30 seconds an 10 minutes a V100, FFT was between 1.3 and 2.2x faster than B-H.

Now that K-L divergence can be calculated, it would be cool to plot that over each iteration. The plot in #3058 is partly wrong, but exact, FFT and B-H all seemed to converge at different rates.

cjnolet · 2021-11-15T21:02:10Z

@zbjornson,

I fully agree we should be comparing larger datasets. It also looks like the early stopping was removed from barnes-hut at some point, which invalidates the benchmarks above. Out of curiosity, what datasets did you use for benchmarking?

Closes #3805 Authors: - Micka (https://github.com/lowener) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: #4361

zbjornson · 2021-11-18T19:32:35Z

@cjnolet I was using synthetic N-dimensional Gaussian blobs with between 4 and 50 dimensions, 100k and ~20M rows. I was using a fixed number of iterations without early stopping. (I have some concerns about the min grad norm for early stopping but haven't had time to dig into it yet.)

edit

Your comment here #4361 (comment)

While the FFT implementation is slower iteration for iteration

is the opposite of my observations.

Closes rapidsai#3805 Authors: - Micka (https://github.com/lowener) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: rapidsai#4361

divyegala added feature request New feature or request ? - Needs Triage Need team to review and classify labels Apr 29, 2021

dantegd changed the title ~~[FEA] Promote FITSNE from experimenatl~~ [FEA] Promote FITSNE from experimental Apr 29, 2021

Nanthini10 removed the ? - Needs Triage Need team to review and classify label May 3, 2021

cjnolet mentioned this issue May 24, 2021

[QST] Default t-SNE output much different from sklearn's on the Iris dataset. #2595

Closed

github-actions bot added the inactive-30d label Jun 2, 2021

dantegd mentioned this issue Aug 2, 2021

[TRACKER] Algorithm issues and tech debt #4139

Open

43 tasks

lowener mentioned this issue Nov 12, 2021

Promote FITSNE from experimental #4361

Merged

rapids-bot bot closed this as completed in #4361 Nov 18, 2021

rapids-bot bot pushed a commit that referenced this issue Nov 18, 2021

Promote FITSNE from experimental (#4361)

cd6fb7f

Closes #3805 Authors: - Micka (https://github.com/lowener) Approvers: - Corey J. Nolet (https://github.com/cjnolet) URL: #4361

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Promote FITSNE from experimental #3805

[FEA] Promote FITSNE from experimental #3805

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

zbjornson commented Apr 30, 2021

github-actions bot commented Jun 2, 2021

cjnolet commented Sep 22, 2021 •

edited

Loading

lowener commented Nov 12, 2021 •

edited

Loading

lowener commented Nov 12, 2021 •

edited

Loading

lowener commented Nov 12, 2021

cjnolet commented Nov 15, 2021 •

edited

Loading

zbjornson commented Nov 15, 2021

cjnolet commented Nov 15, 2021

zbjornson commented Nov 18, 2021 •

edited

Loading

[FEA] Promote FITSNE from experimental #3805

[FEA] Promote FITSNE from experimental #3805

Comments

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

divyegala commented Apr 29, 2021

zbjornson commented Apr 30, 2021

github-actions bot commented Jun 2, 2021

cjnolet commented Sep 22, 2021 • edited Loading

lowener commented Nov 12, 2021 • edited Loading

lowener commented Nov 12, 2021 • edited Loading

lowener commented Nov 12, 2021

cjnolet commented Nov 15, 2021 • edited Loading

zbjornson commented Nov 15, 2021

cjnolet commented Nov 15, 2021

zbjornson commented Nov 18, 2021 • edited Loading

cjnolet commented Sep 22, 2021 •

edited

Loading

lowener commented Nov 12, 2021 •

edited

Loading

lowener commented Nov 12, 2021 •

edited

Loading

cjnolet commented Nov 15, 2021 •

edited

Loading

zbjornson commented Nov 18, 2021 •

edited

Loading