[FEA] `read_json` should output all-nulls columns for the schema columns that do not exist in the input #17091

ttnghia · 2024-10-15T19:02:13Z

In order to fulfill some applications such as Spark, read_json needs to output all-nulls columns for the columns in the input schema that do not exist in the input data. This applies to all schema columns at any nested level.

If there is conflict of interest between applications, we can implement this as a reader option.

The text was updated successfully, but these errors were encountered:

ttnghia added cuIO cuIO issue feature request New feature or request Spark Functionality that helps Spark RAPIDS labels Oct 15, 2024

ttnghia mentioned this issue Oct 15, 2024

[FEA] Improve GpuJsonToStructs performance NVIDIA/spark-rapids#11560

Open

karthikeyann mentioned this issue Oct 21, 2024

JSON spark reader plan for 24.12 #17138

Open

karthikeyann mentioned this issue Nov 6, 2024

Add optional column_order in JSON reader #17029

Merged

3 tasks

rapids-bot bot closed this as completed in #17029 Nov 8, 2024

rapids-bot bot closed this as completed in b3b5ce9 Nov 8, 2024

ttnghia mentioned this issue Nov 15, 2024

[FEA] read_json should output all-nulls columns for the schema columns that do not match with the input JSON #17341

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] `read_json` should output all-nulls columns for the schema columns that do not exist in the input #17091

[FEA] `read_json` should output all-nulls columns for the schema columns that do not exist in the input #17091

ttnghia commented Oct 15, 2024

[FEA] read_json should output all-nulls columns for the schema columns that do not exist in the input #17091

[FEA] read_json should output all-nulls columns for the schema columns that do not exist in the input #17091

Comments

ttnghia commented Oct 15, 2024

[FEA] `read_json` should output all-nulls columns for the schema columns that do not exist in the input #17091

[FEA] `read_json` should output all-nulls columns for the schema columns that do not exist in the input #17091