-
-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug : files on chan archives with four letter extension saved as three letter extensions 404 out #5116
Comments
I downloaded ~50 Regarding renaming those files, you could use the |
Lets use this thread as an example; https://archived.moe/b/thread/912594917/ There are two webms in it - when you attempt to open the links They dont exist, I see now its a failure on the part of the archive sites to have the correct url address - since you can reproduce that on the web without gallery-dl. They are actually accessible if the ending is changed to .web and will load then. archived.moe is the only /b/ archive I know of that allows you to search /b/ - the others dont, so one is kind of forced to go through it to wade through the content. Were you getting the webms from the 4chan site, and not an archive site? |
Oh, you are using
Well, at least I've got an reproducible error now. I'll look into it. |
Yes, all the content on archived.moe is hosted on thebarchive - archived.moe only saves the html and thumbnails But people are forced to use archived.moe to find content anyway, since it is the only /b/ archive that has indexed searching |
I noticed scraping some 4chan archives that host webm files cause the downloader to 404 out on those, it occurs on thebarchive.com
However, those files do exist on the archives, only they are for some reason saved as .web not .webm
https://thebarchive.com/b/full_image/9999999999999.webm
will cause downloader to 404 error, but the file exists as
https://thebarchive.com/b/full_image/9999999999999.web
which can be downloaded (and be renamed manually by me afterwards)
The text was updated successfully, but these errors were encountered: