-
Notifications
You must be signed in to change notification settings - Fork 483
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ORC-1098: [C++] Support specifying type ids or column names in cpp tools #1020
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you so much for the PR, @stiga-huang .
Could you review this please, @wgtmac ?
And, if possible, could you participate the on-going Apache ORC 1.6.13 vote, @stiga-huang and @wgtmac ?
Thank @dongjoon-hyun! Just voted. |
Thank you so much for your participation, @stiga-huang . |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. Thank you, @stiga-huang .
Merged to main for Apache ORC 1.8.
@stiga-huang . After reviewing this PR once more and testing on |
…ols (#1020) ### What changes were proposed in this pull request? This is a follow-up task of #921. Currently we have options for the tools to work on specified top-level column fields. However, ACID ORC files usually have nested structure. We need the type ids to specify nested columns. As an extension, adding support for column names will also be helpful. So we don't need to manually convert column names to type ids. Also reports the valid values when an invalid column name is given. This PR extracts the option parsing codes into ToolsHelper. So similiar cpp tools can share the same option set. ### Why are the changes needed? It makes the tools more useful in practice. ### How was this patch tested? Added unit tests for the new options. (cherry picked from commit 89af2cb) Signed-off-by: Dongjoon Hyun <[email protected]>
…ols (apache#1020) ### What changes were proposed in this pull request? This is a follow-up task of apache#921. Currently we have options for the tools to work on specified top-level column fields. However, ACID ORC files usually have nested structure. We need the type ids to specify nested columns. As an extension, adding support for column names will also be helpful. So we don't need to manually convert column names to type ids. Also reports the valid values when an invalid column name is given. This PR extracts the option parsing codes into ToolsHelper. So similiar cpp tools can share the same option set. ### Why are the changes needed? It makes the tools more useful in practice. ### How was this patch tested? Added unit tests for the new options.
What changes were proposed in this pull request?
This is a follow-up task of #921. Currently we have options for the tools to work on specified top-level column fields. However, ACID ORC files usually have nested structure. We need the type ids to specify nested columns. As an extension, adding support for column names will also be helpful. So we don't need to manually convert column names to type ids. Also reports the valid values when an invalid column name is given.
This PR extracts the option parsing codes into ToolsHelper. So similiar cpp tools can share the same option set.
Why are the changes needed?
It makes the tools more useful in practice.
How was this patch tested?
Added unit tests for the new options.