Extracting EncodedString specs from #134 #151

bf4 · 2015-01-07T05:14:47Z

I need help with the matcher

Also, I decided not to include the new differ specs in here,
to reduce complexity, but am open to adding it.

bf4 · 2015-01-07T06:14:15Z

lib/rspec/support/encoded_string.rb

+        #
+        # Raised by byte <-> char conversions
+        #  RangeError: out of char range
+        #   e.g. the UTF-16LE emoji: 128169.chr


Is this too much detail in the comments?

The detail is helpful, actually.

Nice docs. There's not enough about ruby encodings around the place so this is great.

bf4 · 2015-01-07T06:39:35Z

lib/rspec/support/encoded_string.rb

+        #     and "\x80".encode('UTF-8','ASCII-8BIT', undef: :replace, replace: '<undef>')
+        #     # => '<undef>'
+        #   Encoding::CompatibilityError
+        #    when Enconding.compatbile?(str1, str2) is false


this is technically incorrect. as written, it's when nil. I'll revise if everything else is ok.

I was confusing enc.ascii_compatible?

* Returns whether ASCII-compatible or not. * * Encoding::UTF_8.ascii_compatible? #=> true * Encoding::UTF_16BE.ascii_compatible? #=> false

with rb_enc_check

rb_enc_check(VALUE str1, VALUE str2) { rb_encoding *enc = rb_enc_compatible(str1, str2); if (!enc) rb_raise(rb_eEncCompatError, "incompatible character encodings: %s and %s",

where enc_compatible is `Encoding.compatible?(str1,str2)

Yeah, please fix this. (Good catch!). Also, s/compatbile/compatible/

myronmarston · 2015-01-07T08:00:19Z

I opened a PR against your branch that refactors the matcher and does a couple other small changes:

bf4#1

bf4 · 2015-01-07T14:18:54Z

I opened a PR against your branch that refactors the matcher and does a couple other small changes:

oh fun. Will review

bf4 · 2015-01-08T06:40:11Z

Didn't finish addressing comments tonight. Merged and amended last commit in your PR. Will finish and push code 'tomorrow' 💤

bf4 · 2015-01-09T04:16:36Z

Didn't work on it today. Probably won't again till Sunday.

myronmarston · 2015-01-09T05:10:23Z

That's fine :).

- Add tests for EncodedString#to_s, #split, #<< - For each Encoding failure - assert the expected failure is raised o a String, but not on an 'EncodedString' - assert invalid bytes or unconvertale characters are replaced Currently one test is failing (pending) - EncodedString#split when the string has an invalid byte sequence incorrectly raises an ArgumentError Use 'expect_identical_string' to avoid running expectation failures through the differ, which also uses EncodingString

It’s actually not quite the hot spot it says it here. Encoded strings are only created when an expectation fails that uses a diffable matcher. I believe that expectations normally pass (since people usually try to keep their test suite green and only have a small number of failing specs at a time), so encoded strings are not created by every expectation.

These cases are contrasts (as one example raises the error, but the other does not), so using `and` was confusing since they don’t do the same thing. `vs` makes more sense.

This reads better, provides better failure output, and is composable.

bf4 · 2015-01-15T06:34:24Z

Rebased off of current master and force pushed

Add new commits to PR per discussion for easy review, if ok, would like to discuss which concepts in the pr deserve their own commit and how your PR into my PR should appear. (if you don't care, I'm comfortable just doing something).

Also, I looked into windows encodings more and think I'd like to document how to set the default external encoding that the suite expects on posix and windows, see #151 (comment) and #151 (comment).

bf4 · 2015-01-23T12:13:08Z

@myronmarston This is ready for review, when you get the chance. (Also see previous comment).

Extracting EncodedString specs from #134

myronmarston · 2015-01-23T18:10:52Z

Merged. Thanks, @bf4!

bf4 · 2015-01-23T20:59:19Z

🌈 🎆 🐴 💯 yay!

sj26 · 2015-01-25T17:23:24Z

lib/rspec/support/encoded_string.rb

+        #     vs "\x80".encode('UTF-8','ASCII-8BIT', undef: :replace, replace: '<undef>')
+        #     # => '<undef>'
+        #   Encoding::CompatibilityError
+        #    when Enconding.compatbile?(str1, str2) is false


Maybe should be Encoding. :-)

bf4 force-pushed the encoding_specs branch 2 times, most recently from e6e04bc to 601924a Compare January 7, 2015 06:04

bf4 reviewed Jan 7, 2015
View reviewed changes

bf4 mentioned this pull request Jan 7, 2015

Test EncodedSring#to_s for undefined conversion / invalid byte sequence #134

Closed

bf4 reviewed Jan 7, 2015
View reviewed changes

bf4 and others added 5 commits January 15, 2015 00:29

Add be_identical_string matcher

e3231f4

Make comments more clear.

012e558

These cases are contrasts (as one example raises the error, but the other does not), so using `and` was confusing since they don’t do the same thing. `vs` makes more sense.

Refactor encoding helpers into a custom matcher.

0b61339

This reads better, provides better failure output, and is composable.

bf4 added 4 commits January 15, 2015 00:29

Remove failing spec from this PR per discussion

494e3e7

Explain change in behavior of :invalid => :replace

d7c1885

Clarify string << spec

9a13068

Add expectation per discussion; add an existing char to split on

9eeedb9

bf4 force-pushed the encoding_specs branch from 34a98d3 to 9eeedb9 Compare January 15, 2015 06:30

myronmarston added a commit that referenced this pull request Jan 23, 2015

Merge pull request #151 from bf4/encoding_specs

eb27b60

Extracting EncodedString specs from #134

myronmarston merged commit eb27b60 into rspec:master Jan 23, 2015

sj26 reviewed Jan 25, 2015
View reviewed changes

bf4 deleted the encoding_specs branch January 25, 2015 18:22

myronmarston mentioned this pull request Jan 31, 2015

Spec ./spec/rspec/support/encoded_string_spec.rb:204 will likely never pass on Rubinius #162

Closed

This was referenced Feb 3, 2015

Test Differ#pick_encoding; move to EncodedString #167

Merged

Document how to test #171

Closed

Differ tests no longer use Differ to report diff expectation #174

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting EncodedString specs from #134 #151

Extracting EncodedString specs from #134 #151

bf4 commented Jan 7, 2015

bf4 Jan 7, 2015

myronmarston Jan 7, 2015

sj26 Jan 25, 2015

bf4 Jan 7, 2015

myronmarston Jan 7, 2015

myronmarston commented Jan 7, 2015

bf4 commented Jan 7, 2015

bf4 commented Jan 8, 2015

bf4 commented Jan 9, 2015

myronmarston commented Jan 9, 2015

bf4 commented Jan 15, 2015

bf4 commented Jan 23, 2015

myronmarston commented Jan 23, 2015

bf4 commented Jan 23, 2015

sj26 Jan 25, 2015

bf4 Jan 25, 2015

Extracting EncodedString specs from #134 #151

Extracting EncodedString specs from #134 #151

Conversation

bf4 commented Jan 7, 2015

bf4 Jan 7, 2015

Choose a reason for hiding this comment

myronmarston Jan 7, 2015

Choose a reason for hiding this comment

sj26 Jan 25, 2015

Choose a reason for hiding this comment

bf4 Jan 7, 2015

Choose a reason for hiding this comment

myronmarston Jan 7, 2015

Choose a reason for hiding this comment

myronmarston commented Jan 7, 2015

bf4 commented Jan 7, 2015

bf4 commented Jan 8, 2015

bf4 commented Jan 9, 2015

myronmarston commented Jan 9, 2015

bf4 commented Jan 15, 2015

bf4 commented Jan 23, 2015

myronmarston commented Jan 23, 2015

bf4 commented Jan 23, 2015

sj26 Jan 25, 2015

Choose a reason for hiding this comment

bf4 Jan 25, 2015

Choose a reason for hiding this comment