JSON: Add support for Int128, UInt128 and Half #88962

jozkee · 2023-07-16T07:14:58Z

~~...and add Number support for Utf8JsonReader.CopyString(...).~~
EDIT: Instead of changing CopyString to accept numbers, we will just enable it on an internal helper since we could regret the decission.

Fixes #87994

… for Utf8JsonReader.CopyString(...)

dotnet-issue-labeler · 2023-07-16T07:15:06Z

Note regarding the new-api-needs-documentation label:

This serves as a reminder for when your PR is modifying a ref *.cs file and adding/modifying public APIs, please make sure the API implementation in the src *.cs file is documented with triple slash comments, so the PR reviewers can sign off that change.

ghost · 2023-07-16T07:15:22Z

Tagging subscribers to this area: @dotnet/area-system-text-json, @gregsdennis
See info in area-owners.md if you want to be subscribed.

Issue Details

...and add Number support for Utf8JsonReader.CopyString(...).

Fixes #87994
Fixes #84375

Author:	Jozkee
Assignees:	Jozkee
Labels:	`area-System.Text.Json`, `new-api-needs-documentation`
Milestone:	8.0.0

jozkee · 2023-07-16T07:24:51Z

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

+        private const int MaxFormatLength = 16;
+        private const int MaxEscapedFormatLength = MaxFormatLength * JsonConstants.MaxExpansionFactorWhileEscaping;


I'm not entirely what's a good number here since I couldn't find a good way to determine what's the max amount of bytes Half.TryFormat can consume.

Presumably you mean Half.TryParse? @tannergooding might know

What are you trying to do here in particular? The underlying parsing algorithm needs to be able to track up to 20 significant digits (for Half, it's 113 for Single, and 768 for Double) to ensure a correct result.

However, the entire input string must always be passed in such that it can process non-significant digits (such as leading zeros) and all trailing digits so that it can determine if the rounding goes up or down.

Imagine for example if the user defines 000...005 or 0.500...1, etc. All the zero digits that represented by the ... must be processed to ensure the result is correct and to ensure the relevant end of string is located.

What are you trying to do here in particular?

MaxFormatLength and MaxEscapedFormatLength are meant to be upper limits on the amount of utf8 bytes that can be parsed as Half.

However, the entire input string must always be passed in such that it can process non-significant digits (such as leading zeros) and all trailing digits so that it can determine if the rounding goes up or down.

This is probably what we need to do, we should not do length constraints and instead try to parse the whole number. I think that the Utf8JsonReader does not limit the lenght of the number tokens either e.g:

var s = new string('1', 100_000); byte[] encodedS = Encoding.UTF8.GetBytes(s); var r = new Utf8JsonReader(encodedS); Console.WriteLine(r.Read()); // prints True Console.WriteLine(r.TokenType); // prints Number

We can use pooling for large buffers and regular byte arrays for even larger ones.

@tannergooding, given that Half.TryParse(ROS<byte>, out Half) is not available on .NET 7 (and this code targets it), is BitConverter.ToHalf a good substitute?

Actually I suspect BitConverter.ToHalf doesn't have the TryParse wiggle room.

That's not an equivalent API. BitConverter.ToHalf simply does a reinterpret cast of raw bytes into a Half.

You'd need to do something similar to the default interface implementation for IUtf8SpanParsable done by INumberBase<T> here: https://source.dot.net/#System.Private.CoreLib/src/libraries/System.Private.CoreLib/src/System/Numerics/INumberBase.cs,555

Which is to say, you have to transcode the input string to UTF-16, then try to parse.

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

...ries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/Int128Converter.cs

eiriktsarpalis · 2023-07-16T21:07:51Z

...ries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/Int128Converter.cs

+#if NET8_0_OR_GREATER
+            Span<byte> buffer = stackalloc byte[MaxFormatLength];
+#else
+            Span<char> buffer = stackalloc char[MaxFormatLength];


Is this because UTF-8 parsing overloads don't exist in .NET 7 presumably? Perhaps a comment explaining that might help (since it's difficult to tell without intellisense).

src/libraries/System.Text.Json/tests/Common/JsonTestHelper.cs

src/libraries/System.Text.Json/tests/Common/JsonNumberTestData.cs

src/libraries/System.Text.Json/src/System/Text/Json/Reader/Utf8JsonReader.TryGet.cs

…o an internal helper

eiriktsarpalis · 2023-07-17T18:08:13Z

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

            }
+            finally


We generally don't use try/finally to guard against exceptions in code that uses rented buffers. Not returning a buffer because of an exception is not a big problem all things considered (the problems start if a buffer gets used after being returns or gets returned more than once). You should still move the returning logic above the if (!success) statement above though.

eiriktsarpalis

Other than a few pending issues that should be addressed this looks good to me. Great work David!

Fix handling of floating-point literals on HalfConverter Remove CopyString tests related to Number support

jozkee · 2023-07-18T01:25:19Z

@eiriktsarpalis can you please take another look at the last commits, I found a couple of issues:

For formating infinites we were writing them as ∞ and -∞, specifying CultureInfo.InvariantCulture fixed it.
For parsing NaN and infinites, Half.TryParse was more lax than the current S.T.Json policy and accepted them with any casing i.e: InFiNiTy could be parsed correctly; I fixed it by SequenceEquals the exact bytes we want when the TryParse method returned NaN, PositiveInfinity or NegativeInfinity.

eiriktsarpalis · 2023-07-18T12:59:15Z

For formating infinites we were writing them as ∞ and -∞, specifying CultureInfo.InvariantCulture fixed it.

What configuration is being used when the corresponding values for double and float are being used?

For parsing NaN and infinites, Half.TryParse was more lax than the current S.T.Json policy and accepted them with any casing i.e: InFiNiTy could be parsed correctly; I fixed it by SequenceEquals the exact bytes we want when the TryParse method returned NaN, PositiveInfinity or NegativeInfinity.

Is this an issue though? Citing Postel's law and whatnot perhaps there is a case to be made for tolerating case insensitive identifiers. I'd be surprised if the Half.TryParse behavior is uninentional so perhaps it's there to address valid representations? Perhaps it's an indication that we should also make the double and float parsing logic case insensitive as well. cc @tannergooding

src/libraries/System.Text.Json/tests/Common/NumberHandlingTests.cs

src/libraries/System.Text.Json/src/Resources/Strings.resx

eiriktsarpalis · 2023-07-18T13:16:52Z

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

+                ArrayPool<byte>.Shared.Return(rentedByteBuffer);
+            }
+
+            if (rentedCharBuffer != null)


rentedCharBuffer is only being used in non-net8.0 targets, so perhaps this transcode and parse logic could be moved inside the TryParse helper so that it only gets used in the relevant targets.

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

eiriktsarpalis · 2023-07-18T14:01:47Z

...ries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/Int128Converter.cs

+#endif
+            out Int128 result)
+        {
+            return Int128.TryParse(buffer, CultureInfo.InvariantCulture, out result);


Nit: the size of this method body seems too tiny to warrant extracting to a separate helper.

jozkee · 2023-07-18T14:58:20Z

What configuration is being used when the corresponding values for double and float are being used?

InvariantCulture as well, that's the default for Utf8Formatter.TryFormat:

runtime/src/libraries/System.Private.CoreLib/src/System/Buffers/Text/Utf8Formatter/FormattingHelpers.cs

Line 19 in f7ad726

    
           return value.TryFormat(utf8Destination, out bytesWritten, formatText, CultureInfo.InvariantCulture);

formatText is also equivalent to what we use for the new Converters.

jozkee · 2023-07-18T15:01:37Z

Is this an issue though?

I think it is currently an issue since it differs from the current strictness used on double and float. For consistency I suggest we follow the current convention, we should however evaluate changing it in a separate issue/thread.

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

jozkee · 2023-07-18T15:40:51Z

One of our tests found an assertion failure on Half.TryParse that has reproed consistently only on OSX.

https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-runtime-refs-pull-88962-merge-9757e201f6374e20b5/System.Text.Json.Tests/1/console.78feb75a.log?helixlogtype=result

/private/tmp/helix/working/9EC208C7/w/A9F108F7/e /private/tmp/helix/working/9EC208C7/w/A9F108F7/e
  Discovering: System.Text.Json.Tests (method display = ClassAndMethod, method display options = None)
  Discovered:  System.Text.Json.Tests (found 7160 of 7231 test cases)
  Starting:    System.Text.Json.Tests (parallel test collections = on, max threads = 6)
Process terminated. Assertion failed.
   at System.Globalization.Ordinal.EqualsIgnoreCaseUtf8_Scalar(Byte& charA, Int32 lengthA, Byte& charB, Int32 lengthB) in /_/src/libraries/System.Private.CoreLib/src/System/Globalization/Ordinal.Utf8.cs:line 308
   at System.Number.TryParseFloat[TChar,TFloat](ReadOnlySpan`1 value, NumberStyles styles, NumberFormatInfo info, TFloat& result) in /_/src/libraries/System.Private.CoreLib/src/System/Number.Parsing.cs:line 1229
   at System.Half.TryParse(ReadOnlySpan`1 utf8Text, NumberStyles style, IFormatProvider provider, Half& result) in /_/src/libraries/System.Private.CoreLib/src/System/Half.cs:line 2238
   at System.Text.Json.Serialization.Converters.HalfConverter.TryParse(ReadOnlySpan`1 buffer, Half& result) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs:line 197
   at System.Text.Json.Serialization.Converters.HalfConverter.ReadCore(Utf8JsonReader& reader) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs:line 50
   at System.Text.Json.Serialization.Converters.HalfConverter.ReadNumberWithCustomHandling(Utf8JsonReader& reader, JsonNumberHandling handling, JsonSerializerOptions options) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs:line 115
   at System.Text.Json.Serialization.JsonConverter`1.TryRead(Utf8JsonReader& reader, Type typeToConvert, JsonSerializerOptions options, ReadStack& state, T& value, Boolean& isPopulatedValue) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonConverterOfT.cs:line 193
   at System.Text.Json.Serialization.Metadata.JsonPropertyInfo`1.ReadJsonAndSetMember(Object obj, ReadStack& state, Utf8JsonReader& reader) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Metadata/JsonPropertyInfoOfT.cs:line 308
   at System.Text.Json.Serialization.Converters.ObjectDefaultConverter`1.OnTryRead(Utf8JsonReader& reader, Type typeToConvert, JsonSerializerOptions options, ReadStack& state, T& value) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Object/ObjectDefaultConverter.cs:line 49
   at System.Text.Json.Serialization.JsonConverter`1.TryRead(Utf8JsonReader& reader, Type typeToConvert, JsonSerializerOptions options, ReadStack& state, T& value, Boolean& isPopulatedValue) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonConverterOfT.cs:line 258
   at System.Text.Json.Serialization.JsonConverter`1.ReadCore(Utf8JsonReader& reader, JsonSerializerOptions options, ReadStack& state) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonConverterOfT.ReadCore.cs:line 51
   at System.Text.Json.Serialization.Metadata.JsonTypeInfo`1.Deserialize(Utf8JsonReader& reader, ReadStack& state) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/Metadata/JsonTypeInfoOfT.ReadHelper.cs:line 22
   at System.Text.Json.JsonSerializer.ReadFromSpan[TValue](ReadOnlySpan`1 utf8Json, JsonTypeInfo`1 jsonTypeInfo, Nullable`1 actualByteCount) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonSerializer.Read.Span.cs:line 160
   at System.Text.Json.JsonSerializer.ReadFromSpan[TValue](ReadOnlySpan`1 json, JsonTypeInfo`1 jsonTypeInfo) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonSerializer.Read.String.cs:line 443
   at System.Text.Json.JsonSerializer.Deserialize[TValue](String json, JsonSerializerOptions options) in /_/src/libraries/System.Text.Json/src/System/Text/Json/Serialization/JsonSerializer.Read.String.cs:line 55
   at System.Text.Json.Serialization.Tests.JsonSerializerWrapper.StringSerializerWrapper.DeserializeWrapper[T](String json, JsonSerializerOptions options) in /_/src/libraries/System.Text.Json/tests/System.Text.Json.Tests/Serialization/JsonSerializerWrapper.Reflection.cs:line 129
   at System.Text.Json.Serialization.Tests.NumberHandlingTests.<>c__DisplayClass39_0.<<FloatingPointConstants_Fail>b__1>d.MoveNext() in /_/src/libraries/System.Text.Json/tests/Common/NumberHandlingTests.cs:line 883
   at System.Runtime.CompilerServices.AsyncMethodBuilderCore.Start[TStateMachine](TStateMachine& stateMachine) in /_/src/libraries/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncMethodBuilderCore.cs:line 38
   at System.Text.Json.Serialization.Tests.NumberHandlingTests.<>c__DisplayClass39_0.<FloatingPointConstants_Fail>b__1()

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs

src/libraries/System.Text.Json/tests/Common/NumberHandlingTests.cs

JSON: Add support for Int128, UInt128 and Half and add Number support…

b4b569e

… for Utf8JsonReader.CopyString(...)

jozkee added this to the 8.0.0 milestone Jul 16, 2023

jozkee requested review from krwq, eiriktsarpalis and layomia July 16, 2023 07:14

jozkee self-assigned this Jul 16, 2023

dotnet-issue-labeler bot added area-System.Text.Json new-api-needs-documentation labels Jul 16, 2023

jozkee commented Jul 16, 2023

View reviewed changes

eiriktsarpalis reviewed Jul 16, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Outdated Show resolved Hide resolved

eiriktsarpalis reviewed Jul 16, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Outdated Show resolved Hide resolved

eiriktsarpalis reviewed Jul 16, 2023

View reviewed changes

...ries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/Int128Converter.cs Outdated Show resolved Hide resolved

eiriktsarpalis reviewed Jul 16, 2023

View reviewed changes

src/libraries/System.Text.Json/tests/Common/JsonTestHelper.cs Show resolved Hide resolved

eiriktsarpalis reviewed Jul 16, 2023

View reviewed changes

src/libraries/System.Text.Json/tests/Common/JsonNumberTestData.cs Show resolved Hide resolved

krwq reviewed Jul 17, 2023

View reviewed changes

src/libraries/System.Text.Json/src/System/Text/Json/Reader/Utf8JsonReader.TryGet.cs Outdated Show resolved Hide resolved

jozkee added 2 commits July 17, 2023 12:46

Remove parsing limits on Read and move Number support of CopyString t…

d42cf66

…o an internal helper

Fix AllowNamedFloatingPointLiterals on Write for Half

8b71bf3

eiriktsarpalis reviewed Jul 17, 2023

View reviewed changes

eiriktsarpalis approved these changes Jul 17, 2023

View reviewed changes

eiriktsarpalis mentioned this pull request Jul 17, 2023

Extend Utf8JsonReader.CopyString to also accept JsonTokenType.Number values #84375

Open

jozkee added 2 commits July 17, 2023 19:33

Specify InvariantCulture on TryParse and TryFormat

38483e5

Fix handling of floating-point literals on HalfConverter Remove CopyString tests related to Number support

Add test for invalid number input format

2c5cb22

jozkee changed the title ~~JSON: Add support for Int128, UInt128 and Half…~~ JSON: Add support for Int128, UInt128 and Half Jul 18, 2023

Fix net6.0 build error about missing Half.TryParse overload

22bc1f7

build-analysis bot mentioned this pull request Jul 18, 2023

Tests.System.TimeProviderTests.TestProviderTimer test failure #87477

Closed

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

src/libraries/System.Text.Json/tests/Common/NumberHandlingTests.cs Show resolved Hide resolved

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

src/libraries/System.Text.Json/src/Resources/Strings.resx Show resolved Hide resolved

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Show resolved Hide resolved

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Show resolved Hide resolved

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Outdated Show resolved Hide resolved

Move rentedCharBuffer logic to TryParse helper

43fe8a8

jozkee mentioned this pull request Jul 18, 2023

Half.TryParse assertion failure on System.Globalization.Ordinal.EqualsIgnoreCaseUtf8_Scalar #89094

Closed

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

...raries/System.Text.Json/src/System/Text/Json/Serialization/Converters/Value/HalfConverter.cs Outdated Show resolved Hide resolved

eiriktsarpalis reviewed Jul 18, 2023

View reviewed changes

src/libraries/System.Text.Json/tests/Common/NumberHandlingTests.cs Show resolved Hide resolved

jozkee added 2 commits July 18, 2023 12:08

Address feedback

02e83b3

Disable test for OSX

8c3e143

eiriktsarpalis approved these changes Jul 18, 2023

View reviewed changes

jozkee merged commit e2c04e0 into dotnet:main Jul 18, 2023

jozkee deleted the json-new-numbers-support branch July 18, 2023 19:35

ghost locked as resolved and limited conversation to collaborators Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JSON: Add support for Int128, UInt128 and Half #88962

JSON: Add support for Int128, UInt128 and Half #88962

jozkee commented Jul 16, 2023 •

edited

Loading

dotnet-issue-labeler bot commented Jul 16, 2023

ghost commented Jul 16, 2023

jozkee Jul 16, 2023 •

edited

Loading

eiriktsarpalis Jul 16, 2023

tannergooding Jul 17, 2023

jozkee Jul 17, 2023

jozkee Jul 17, 2023

tannergooding Jul 17, 2023

eiriktsarpalis Jul 16, 2023 •

edited

Loading

eiriktsarpalis Jul 17, 2023

eiriktsarpalis left a comment

jozkee commented Jul 18, 2023 •

edited

Loading

eiriktsarpalis commented Jul 18, 2023 •

edited

Loading

eiriktsarpalis Jul 18, 2023

eiriktsarpalis Jul 18, 2023

jozkee commented Jul 18, 2023

jozkee commented Jul 18, 2023

jozkee commented Jul 18, 2023

		private const int MaxFormatLength = 16;
		private const int MaxEscapedFormatLength = MaxFormatLength * JsonConstants.MaxExpansionFactorWhileEscaping;

JSON: Add support for Int128, UInt128 and Half #88962

JSON: Add support for Int128, UInt128 and Half #88962

Conversation

jozkee commented Jul 16, 2023 • edited Loading

dotnet-issue-labeler bot commented Jul 16, 2023

ghost commented Jul 16, 2023

jozkee Jul 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eiriktsarpalis Jul 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eiriktsarpalis left a comment

Choose a reason for hiding this comment

jozkee commented Jul 18, 2023 • edited Loading

eiriktsarpalis commented Jul 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jozkee commented Jul 18, 2023

jozkee commented Jul 18, 2023

jozkee commented Jul 18, 2023

jozkee commented Jul 16, 2023 •

edited

Loading

jozkee Jul 16, 2023 •

edited

Loading

eiriktsarpalis Jul 16, 2023 •

edited

Loading

jozkee commented Jul 18, 2023 •

edited

Loading

eiriktsarpalis commented Jul 18, 2023 •

edited

Loading