Use `RowBinaryWithNamesAndTypes` #10

loyd · 2021-04-20T07:52:46Z

Use RowBinaryWithNamesAndTypes instead of RowBinary as a primary format in order to support deserialize_any and, hence, serde(flatten) and type conversions & validation.

The text was updated successfully, but these errors were encountered:

makorne · 2021-10-01T05:55:00Z

Hi! Any progress with this?
May be you need a beta-tester? :)

Looks like a very important enhancement.
Sometimes I get NotEnoughData on a query that works with another time bind argument, and works in clickhouse-client without any issues. And not a hint why.

wspeirs · 2023-10-18T18:53:26Z

@loyd do you have a branch for the RowBinaryWithNamesAndTypes work you can share? Happy to help contribute, but also don't want to start-from-scratch if you're already close :-)

**SUPER HACKY** * Allows for fetching cell-by-cell, with a different type for each * This is a bit of a hack, but can be a solution for ClickHouse#10

wspeirs · 2023-10-18T20:43:42Z

@loyd this is a super-hack (but min number of changes) that can get us cell-by-cell fetching of data. I see a few issues with this approach (besides the hackiness, but some of it can be cleaned up):

The ?fields marker for the SELECT statement is no longer supported because there is no type to get the fields from. I don't think this is a hug deal, but open to feedback.
If you attempt to get a cell by the wrong type, it could start reading data from the next cell, or return a "not enough data" error. I don't think this is a huge deal either, because the same thing happens if you specify the wrong type now.

Going with RowBinaryWithNamesAndTypes is going to be tough because we'd have to translate the string names to the actual types... not impossible, but annoying.

I welcome any/all feedback... thanks!

loyd · 2024-01-27T15:52:01Z

Providing support for deserializing different rows into different types is the foot gun; CH always returns specific data shapes without any variation.

Moving to RowBinaryWithNamesAndTypes is an expected improvement not only for supporting deserialize_any, but, first of all, for validation purposes to prevent a schema violation (e.g., #100).

It's not a problem to replace RowBinary with RowBinaryWithNamesAndTypes, but we need to decide in which cases is conversion is allowed and when an error should rise.

Support for SELECTs and INSERTs is different. During INSERT, the conversion is performed on the CH side. Moreover, the behavior is not specified directly, only

If setting input_format_with_types_use_header is set to 1, the types from input data will be compared with the types of the corresponding columns from the table. Otherwise, the second row will be skipped.

So, I need to check the actual behavior (and maybe check other libraries).

On the SELECTs, the conversion is performed on the client side and we have more control over it.

Also, the behavior should remain same in case of moving to TCP+Native.

loyd added the enhancement New feature or request label Jul 23, 2021

loyd mentioned this issue Sep 25, 2021

Fetching wrong columns #21

Closed

This was referenced Nov 8, 2022

Still in active development? #45

Closed

UUID does not match #26

Closed

loyd mentioned this issue Jan 4, 2023

Feature Support: Generic Query, How to get schema and data while query? #53

Open

loyd mentioned this issue Mar 21, 2023

CANNOT_READ_ALL_DATA on inserter commit/end #57

Closed

wspeirs added a commit to wspeirs/clickhouse.rs that referenced this issue Oct 18, 2023

Added the ability to fetch cell-by-cell

3e72638

**SUPER HACKY** * Allows for fetching cell-by-cell, with a different type for each * This is a bit of a hack, but can be a solution for ClickHouse#10

loyd mentioned this issue Jan 27, 2024

NULL successfully deserializes into u8, i8, and bool #100

Closed

loyd mentioned this issue Jul 25, 2024

Erroneous behaviour when selecting multiple booleans #112

Closed

loyd mentioned this issue Aug 18, 2024

Make serialize_into and deserialize_from public #127

Closed

pravic mentioned this issue Sep 17, 2024

Fetch rows in JSON format #152

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `RowBinaryWithNamesAndTypes` #10

Use `RowBinaryWithNamesAndTypes` #10

loyd commented Apr 20, 2021 •

edited

Loading

makorne commented Oct 1, 2021

wspeirs commented Oct 18, 2023

wspeirs commented Oct 18, 2023 •

edited

Loading

loyd commented Jan 27, 2024

Use RowBinaryWithNamesAndTypes #10

Use RowBinaryWithNamesAndTypes #10

Comments

loyd commented Apr 20, 2021 • edited Loading

makorne commented Oct 1, 2021

wspeirs commented Oct 18, 2023

wspeirs commented Oct 18, 2023 • edited Loading

loyd commented Jan 27, 2024

Use `RowBinaryWithNamesAndTypes` #10

Use `RowBinaryWithNamesAndTypes` #10

loyd commented Apr 20, 2021 •

edited

Loading

wspeirs commented Oct 18, 2023 •

edited

Loading