Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question regarding the dataset at HuggingFace #63

Open
anwai98 opened this issue May 29, 2024 · 1 comment
Open

Question regarding the dataset at HuggingFace #63

anwai98 opened this issue May 29, 2024 · 1 comment

Comments

@anwai98
Copy link

anwai98 commented May 29, 2024

Hi team,

Thanks for open-sourcing your amazing effort.

I have a question: I managed to download the dataset you provided hosted at https://huggingface.co/datasets/OpenGVLab/SA-Med2D-20M, and looks like only a part of the dataset has been released (composed of ~3.7M images and ~15.8M masks).

Do you plan to provide access to the rest of the images (according to the paper, it is marked as the "test set", comprising ~0.92M images and ~3.9M masks)? Would be nice to check it out as well.

Thanks in advance!

@anwai98
Copy link
Author

anwai98 commented May 30, 2024

Hi team,

Would like to share a few mentions and verify if this is the case or not:

  1. Some input images from the Brain_PTM dataset appear a bit weird (for example: images/mr_t1--Brain_PTM--case0005--x_0052.png, images/mr_t1--Brain_PTM--case0005--x_0056.png, a few of the many which appear to be "binary mask of the brain" as the "input image")
  2. Some images in QUBI2020 dataset have a similar strange appearance, especially the "brain_growth" samples (for example: images/ct_00--QUBIQ2020--1_brain-growth_case01--2d_none.png, images/ct_00--QUBIQ2020--1_brain-growth_case25--2d_none.png, a few of the many which do not appear as the tissue region itself, rather a binary-ish visual)
  3. For the autoPET dataset, the images are formed using the CT scans-only right?
    • EDIT: Missed this one, apologies. The PET scans are available as a separate image paired with the lesions under the modality pet.
  4. Some images have mismatching shapes for their respective ground-truth (I could spot only two at the moment): x_ray--covid_19_ct_cxr--auntminnie-2020_01_31_20_24_2322_2020_01_31_x-ray_coronavirus_US--2d_none.png and x_ray--covid_19_ct_cxr--radiopaedia-2019-novel-coronavirus-infected-pneumonia--2d_none.png

I'll come around with more questions, if any.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant