Support url-encoded characters in URL credentials #3732

BrownTruck · 2016-05-26T10:52:56Z

This was automatically migrated from #3237 to reparent it to the master branch. Please see original pull request for any previous discussion.

Original Submitter: @mjwillson

This change is

qdamian · 2017-03-26T04:11:39Z

Is the lack of a news entry what's blocking this PR? I am too experiencing #3236 and would appreciate it if this fix was merged.

Ivoz · 2017-03-26T10:38:47Z

pip/download.py

@@ -117,6 +118,13 @@ def user_agent():
    )


+def unquote(s):
+    if six.PY2:
+        return urllib_unquote(s.encode("utf-8")).decode("utf-8")


Since internally urllib.unquote will simply re-en/decode unicode characters given to it as latin1, I can't see the point of the little utf-8 jig that's written here, but maybe I've thought about it wrong?

Thanks for reviewing.
I think because of the encode("utf-8") the argument of urllib_unquote will be of type str in this case, so the _is_unicode condition will not be met and the bytes will not be decoded as latin1.

This means that unicode characters like the £ used in the unit tests are decoded:

>>> urllib.unquote(u'%C2%A3'.encode("utf-8")).decode("utf-8") u'\xa3' >>> urllib.unquote('%C2%A3'.encode("utf-8")).decode("utf-8") u'\xa3'

Which wouldn't be decoded otherwise:

>>> urllib.unquote(u'%C2%A3') u'\xc2\xa3' >>> urllib.unquote('%C2%A3') '\xc2\xa3'

Ivoz · 2017-03-26T10:44:50Z

tests/unit/test_download.py

+
+def test_parse_credentials():
+    auth = MultiDomainBasicAuth()
+    assert auth.parse_credentials(u"foo:[email protected]") == (u'foo', u'bar')


In python 3, strings are natively unicode anyway; in python 2, I don't think this should be unicode here (either expecting to receive it, nor outputting). What you've output in the new function is unicode but that's because of the utf-8 jig I'm not sure about.

OK. We can change it to use str in Python 2. I don't know if the author of this patch would be interested to work on this, because the original patch is from Nov '15. If not, I volunteer to create a new pull request with these changes, if that's fine with you.

Support url-encoded characters in URL credentials

d880921

BrownTruck added the migrated from develop label May 26, 2016

BrownTruck mentioned this pull request May 26, 2016

Support url-encoded characters in URL credentials #3237

Closed

xavfernandez closed this Jan 6, 2017

xavfernandez reopened this Jan 6, 2017

9nix00 mentioned this pull request Feb 7, 2017

Failure to authenticate private repository when URL-encoded character in password #3236

Closed

xavfernandez mentioned this pull request Mar 22, 2017

Can't install via custom index when username or password ends with a pound sign #4364

Closed

Ivoz reviewed Mar 26, 2017

View reviewed changes

qdamian mentioned this pull request Mar 31, 2017

Finish PR 3732: Handle url encoded credentials #4393

Merged

dstufft closed this Apr 1, 2017

lock bot added the auto-locked Outdated issues that have been locked by automation label Jun 3, 2019

lock bot locked as resolved and limited conversation to collaborators Jun 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support url-encoded characters in URL credentials #3732

Support url-encoded characters in URL credentials #3732

BrownTruck commented May 26, 2016 •

edited by dstufft

Loading

qdamian commented Mar 26, 2017

Ivoz Mar 26, 2017 •

edited

Loading

qdamian Mar 26, 2017

Ivoz Mar 26, 2017

qdamian Mar 26, 2017

Support url-encoded characters in URL credentials #3732

Support url-encoded characters in URL credentials #3732

Conversation

BrownTruck commented May 26, 2016 • edited by dstufft Loading

qdamian commented Mar 26, 2017

Ivoz Mar 26, 2017 • edited Loading

Choose a reason for hiding this comment

qdamian Mar 26, 2017

Choose a reason for hiding this comment

Ivoz Mar 26, 2017

Choose a reason for hiding this comment

qdamian Mar 26, 2017

Choose a reason for hiding this comment

BrownTruck commented May 26, 2016 •

edited by dstufft

Loading

Ivoz Mar 26, 2017 •

edited

Loading