Remove dict_id
from arrow_schema::field::Field
and make dictionary IDs an internal implementation detail of flight encoding/decoding
#5981
Labels
enhancement
Any new improvement worthy of a entry in the changelog
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Currently the
dict_id
field is only used for the purposes of arrow flight encoding/decoding so dictionaries can be mapped to there associated fields.This is annoying and error-prone as the user is left the responsibility of assigning these dictionary IDs and ensuring that they are unique.
#5971 adds the option to auto-assign dictionary IDs during arrow flight encoding. This can be enabled by setting the
preserve_dict_id
option inIpcWriteOptions
tofalse
(current default istrue
Describe the solution you'd like
This can be done in stages but ultimately would like to
preserve_dict_id
default tofalse
preserve_dict_id
option altogetherdict_id
field fromarrow_schema::schema::Field
entirely as it no longer has any purposeDescribe alternatives you've considered
We can leave this is as a configurable option and either only do 1 above or we can leave auto-assigning of dictionary IDs as an opt-in feature
Additional context
The text was updated successfully, but these errors were encountered: