Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding a checksum when downloading figshare files #655

Merged
merged 3 commits into from
Jul 20, 2023

Conversation

flefebv
Copy link
Contributor

@flefebv flefebv commented Jul 19, 2023

Closes #642
Supersedes #654

Modifies the _fetch_figshare function to checksum the files when downloading.

Remark: For openml, the checksum is already done within the sklearn function fetch_openml.

Warning: does not deal with world_bank, because world_bank does not provide checksum, and we do not have a fixed list of datasets that the user can download from world_bank.

skrub/datasets/_fetching.py Outdated Show resolved Hide resolved
@GaelVaroquaux
Copy link
Member

LGTM. Merging

@GaelVaroquaux GaelVaroquaux enabled auto-merge (squash) July 20, 2023 12:42
@GaelVaroquaux GaelVaroquaux merged commit b421798 into skrub-data:main Jul 20, 2023
20 of 21 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

We should checksum the file to be sure that downloading went fine
3 participants