Disable checksum filtering #118

szabgab · 2022-09-15T10:35:09Z

I have a slow external disks connected to the computer with thousands of large files (videos). comparing the checksums take a lot of time. Would it be possible to disable it?

fire-eggs · 2022-09-15T14:47:34Z

You might try the -maxsize option to ignore files larger than a given size.

szabgab · 2022-09-19T08:34:29Z

Thanks for the suggestion, but I think the -maxsize option would mean I don't get the report about the large file and I would like to see if I might have 2-3 copies of the same 1GB files so I can get rid of them.

einsteinx2 · 2024-03-17T16:03:50Z

I have this text in my PR, but figured it would be good to comment with it here as well for discussion.

To the maintainer(s), I think there is a real use case for this:

In my example, I have a NAS with dozens of TB of storage. I have a lot of files that are significantly large (dozens to hundreds of GB). I've been doing some file reorganization and my goal is just to get a short list of files that might be duplicates that I can then process later.

In my case, if these large files have the same name and size and first/last bytes, I can be basically 100% sure they're the same file. It would take exponentially longer to checksum every file at discovery time only to find out what I already know...they're the same.

Later if needed I can manually checksum files in the results list, or even do something like test/checksum random byte ranges in the files rather than checksumming the whole thing to save a ton of time, but in my case (and I assume others' as well) I already know they're the same so even that isn't needed.

To be clear, I think the default should stay the same (or even go up to sha256/512), but at least having the option to disable checksumming can be very useful and the change is only a few lines.

szabgab added a commit to szabgab/rdfind that referenced this issue Sep 15, 2022

allow for '-checksum none' to disable checksum filtering. pauldreik#118

d3c6a52

einsteinx2 linked a pull request Mar 17, 2024 that will close this issue

add option to disable checksum verification #153

Open

robfrawley linked a pull request May 11, 2024 that will close this issue

add option to disable checksum verification robfrawley/rdfind#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable checksum filtering #118

Disable checksum filtering #118

szabgab commented Sep 15, 2022

fire-eggs commented Sep 15, 2022

szabgab commented Sep 19, 2022 •

edited

Loading

einsteinx2 commented Mar 17, 2024

Disable checksum filtering #118

Disable checksum filtering #118

Comments

szabgab commented Sep 15, 2022

fire-eggs commented Sep 15, 2022

szabgab commented Sep 19, 2022 • edited Loading

einsteinx2 commented Mar 17, 2024

szabgab commented Sep 19, 2022 •

edited

Loading