-
-
Notifications
You must be signed in to change notification settings - Fork 975
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug?] [Imgur] [Reddit] Downloading an imgur link uses reddit keywords in --list-keywords
but not when actually downloading
#1687
Comments
Not sure if I can follow... Do you use the Because I seem to get the expected keywords for reddit, e.g: PS E:\> $exampleURL = "https://www.reddit.com/r/tumblr/comments/oi6hf0/all_librarian_lives_matter/"
PS E:\> gallery-dl -K -v $exampleURL | sls -NoEmphasis -Context 0,1 "author|category|subreddit|subcategory"
[gallery-dl][debug] Version 1.18.1
[gallery-dl][debug] Python 3.9.6 - Windows-10-10.0.19042-SP0
[gallery-dl][debug] requests 2.25.1 - urllib3 1.26.6
[gallery-dl][debug] Starting KeywordJob for 'https://www.reddit.com/r/tumblr/comments/oi6hf0/all_librarian_lives_matter/'
[reddit][debug] Using RedditSubmissionExtractor for 'https://www.reddit.com/r/tumblr/comments/oi6hf0/all_librarian_lives_matter/'
[reddit][info] Refreshing private access token
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): www.reddit.com:443
[urllib3.connectionpool][debug] https://www.reddit.com:443 "POST /api/v1/access_token HTTP/1.1" 200 201
[urllib3.connectionpool][debug] Starting new HTTPS connection (1): oauth.reddit.com:443
[urllib3.connectionpool][debug] https://oauth.reddit.com:443 "GET /comments/oi6hf0/.json?limit=20&raw_json=1 HTTP/1.1" 200 4216
> author
Thryloz
> author_flair_background_color
None
> author_flair_css_class
None
> author_flair_richtext[]
> author_flair_template_id
None
> author_flair_text
None
> author_flair_text_color
None
> author_flair_type
text
> author_fullname
t2_bdthneg5
> author_patreon_flair
False
> author_premium
True
> category
reddit
> removed_by_category
None
> subcategory
submission
> subreddit
tumblr
> subreddit_id
t5_2r7hk
> subreddit_name_prefixed
r/tumblr
> subreddit_subscribers
1019946
> subreddit_type
public
> author
Thryloz
> author_flair_background_color
None
> author_flair_css_class
None
> author_flair_richtext[]
> author_flair_template_id
None
> author_flair_text
None
> author_flair_text_color
None
> author_flair_type
text
> author_fullname
t2_bdthneg5
> author_patreon_flair
False
> author_premium
True
> category
reddit
> removed_by_category
None
> subcategory
submission
> subreddit
tumblr
> subreddit_id
t5_2r7hk
> subreddit_name_prefixed
r/tumblr
> subreddit_subscribers
1019946
> subreddit_type
public
PS E:\> |
If you try actually downloading the provided link, it uses the imgur extractor and thus the imgur keywords |
Yes, I mean the image in this case is hosted on Imgur. This is how the reddit extractor usually worked, I believe.. |
I wasn't aware of that option. It works pretty much perfectly for my purposes Though this doesn't fix the issue of Either way, thanks for your help! |
It does more or less the same as $ gallery-dl -g https://www.reddit.com/r/tumblr/comments/oi6hf0
https://i.imgur.com/CtXWfp2.jpg
$ gallery-dl https://i.imgur.com/CtXWfp2.jpg
/tmp/imgur/imgur_CtXWfp2_All Librarian lives matter.jpg and you can get all available keywords by using There are several
You aren't, no need to worry, but quite a lot of your feature requests are quite complicated or at least not as simple as, for example, changing a regex. |
gallery-dl -K https://www.reddit.com/r/tumblr/comments/oi6hf0/all_librarian_lives_matter/
gives reddit keywords despitegallery-dl https://www.reddit.com/r/tumblr/comments/oi6hf0/all_librarian_lives_matter/
using imgur keywords. This probably happens with other sites as well but I haven't tested itI'm not too sure how to properly resolve the issue of this messing up archives though. The only real solution I can think of is letting reddit access the linked site's keywords, but trying to put that in an
extractors.*.filename
would be a mess even with conditional filenames. Alternatively you could pass the entire config into the filename and do something obscene like"filename":"reddit-{subreddit}-{config[extractors][{linked_category}][filename]}"
. Look me in the eye and tell me that is a good ideaFor the time being I'm only downloading one user so I can just hard-code them into a special directory, but there's almost certainly people for whom this causes problems
Side note: I hope I'm not being annoying with how many issues/feature requests I'm submitting. I'm trying to keep them high quality but I wouldn't be surprised if I get caught up in a spam filter sometimes
The text was updated successfully, but these errors were encountered: