*_ndjson 'utf-8' encoding issue #10034
Labels
accepted
Ready for implementation
bug
Something isn't working
python
Related to Python Polars
regression
Issue introduced by a new release
Checks
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of Polars.
Reproducible example
the test.jsonl file is available here https://gist.github.com/getorca/d3a6460f0d14b573c1d38322828d34d8#file-test-jsonl
Issue description
throws a unicode error
It seems to be related to
read_ndjson
andscan_ndjson
as loading withfrom_dicts
the string is encoded properlyalso adding several characters to the end of the json value like "...France: €10.57\n* Germany: €10.45\n some more random words make it work" results in it being encoded properly
Expected behavior
string is encoded properly.
Installed versions
The text was updated successfully, but these errors were encountered: