Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to import dataset from collection? #151

Open
YXC33 opened this issue Aug 19, 2024 · 7 comments
Open

How to import dataset from collection? #151

YXC33 opened this issue Aug 19, 2024 · 7 comments

Comments

@YXC33
Copy link

YXC33 commented Aug 19, 2024

Hi! I met a problem when I tried to import a dataset from a collection.
Normally, when the dataset is not from a collection, I can get the dataset using these codes:
curated_seurat_object <- metadata |> dplyr::filter( dataset_id == '37b21763-7f0f-41ae-9001-60bad6e2841d' ) |> get_seurat()

However, if I change the dataset_id to a dataset from collections, I can't get the dataset. For example:
curated_seurat_object <- metadata |> dplyr::filter( dataset_id == '9fcb0b73-c734-40a5-be9c-ace7eea401c9' ) |> get_seurat()

And the web page of this dataset is: https://cellxgene.cziscience.com/e/9fcb0b73-c734-40a5-be9c-ace7eea401c9.cxg/

'9fcb0b73-c734-40a5-be9c-ace7eea401c9' %in% unique(dplyr::pull(metadata, "dataset_id")) [1] FALSE
Since this code returned FALSE, I know it's not because of the size.

Could you please tell me how to get the dataset from collections?

@stemangiola
Copy link
Owner

have you tried the column collection_id?

@YXC33
Copy link
Author

YXC33 commented Aug 19, 2024

have you tried the column collection_id?

Thanks for the suggestion. However, I have tried the collection_id, and used this code:
'71f4bccf-53d4-4c12-9e80-e73bfb89e398' %in% unique(dplyr::pull(metadata, "collection_id")) [1] FALSE
This the collection page, https://cellxgene.cziscience.com/collections/71f4bccf-53d4-4c12-9e80-e73bfb89e398

And I checked there are only 76 collections in the collection_id column.

@stemangiola
Copy link
Owner

@myushen ?

@myushen
Copy link
Contributor

myushen commented Aug 20, 2024

Hi @YXC33, it seems that dataset_id 9fcb0b73-c734-40a5-be9c-ace7eea401c9 does not exist in our metadata, thus it won't be able to generate Seurat.

This could happen if the dataset was published recently and our API haven't incorporated it.

@stemangiola
Copy link
Owner

We will have the updated CELLxGENE in one week or so

@YXC33
Copy link
Author

YXC33 commented Aug 30, 2024

May I ask what's the frequency of updating? Also, for the datasets, we only have 329 unique datasets, but in Cellxgene, there are 1486 datasets.

@stemangiola
Copy link
Owner

Hello, we are completing the most recent update. Probably 2 weeks from publication.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants