Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MultiMAP using precomputed distance matrices #11

Open
ulo opened this issue Jul 5, 2024 · 1 comment
Open

MultiMAP using precomputed distance matrices #11

ulo opened this issue Jul 5, 2024 · 1 comment

Comments

@ulo
Copy link

ulo commented Jul 5, 2024

Hi!
I would like to try MultiMAP for integrating a variety of datasets on a specific protein family.
This includes classical omics data like RNA-seq or targeted metabolomics, and also structural information and protein annotations.
For each of the different data modalities, I derived distance matrices which I can generate nice individual UMAPs on (using the metric='precomputed' parameter).
And as MultiMAP is a generalization of UMAP, would it theoretically work to create a MultiMAP based on these distances?
I am aware that the current implementation does not support this, but I would like to know if conceptually this would be possible.
I tried to figure out by looking at your source code, but I am unfortunately neither a mathematician nor a Python expert...

Many thanks!
Ulrich

@ktpolanski
Copy link
Contributor

I had a root around Mika's source code and in principle this could be possible (not with the current code, like you noticed). The algorithm generates a set of distances and KNN indices for each individual dataset, and then for each pairwise combination of datasets. I'm not sure how that would work with your data though, as you need shared features to compute a pairwise representation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants