-
-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[furaffinity] Downloading some already downloaded files #776
Comments
I'm sure it has to do with the fact that there are times when they've re-encoded filetypes for size or something. I cant tell you how many times I've seen periods from a sizable gallery, while getting ripped, go from being identified initially as a PNG, to being ripped as a JPG, because of whatever crap the admin team was doing at the time. It's able to get PNG's, but stuff from, say like 3 years ago, will have a period where that happens. So maybe, the ripper gets confused by this behavior. I've seen it happen to with stuff I've already got as well. |
This happens because of https://www.furaffinity.net/view/32690490 appears to be a |
TIL Anyway is it then checked if the file exists with the new filename? Maybe it should perform another check once |
I think the issue here is that adjust-extension happens after it has been downloaded. This actually explains the issue I'm having with e-hentai/exhentai because all retrieved items have no extension then are given an extension based on the content. |
Check file headers against a list of file signatures before downloading the whole file and writing it to disk. The file signature check needs some improvements (*), but it produces usable results for the most part. (*) - 'webp', 'wav', and others start with 'RFFI' - 'svg' uses the same "signature" as all XML documents - 'webm' has the same signature as 'mkv' files - only 'mp3' files in an ID3v2 container get recognized
Should be fixed in 536c088, but these are some rather significant changes to the HTTP downloader code and there are most likely some bugs in there. I'll leave this open for now so you can report any crashes etc in here. The file signature check also needs some work, but it is good enough for now. |
Sadly it completely breaks deviantart #1144 |
There seems to be an issue with JPEG extensions, gallery-dl doesn't recognize previously downloaded images that have a .jpeg file extension. I just upgraded to the latest dev version and images that were previously downloaded with .jpeg extensions are now downloaded with .jpg extensions. This would result in a lot of duplicates, every old .jpeg image would now have an identical .jpg copy. An example of this can be seen with This could be resolved by running something like |
@mikf Ah, so that's what |
Seemingly at random (but always the same URLs, can't see any pattern tho) gallery-dl will download an already downloaded file, I compared the output of a URL that works as expected and one that re-downloads if you run the command a second time:
(both files exist in output directory, no new files appear)
The text was updated successfully, but these errors were encountered: