-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Databricks loader: Support for generated columns #951
Milestone
Comments
istreeter
added a commit
that referenced
this issue
Jun 24, 2022
istreeter
added a commit
that referenced
this issue
Jun 25, 2022
This was referenced Jun 25, 2022
Closed
istreeter
added a commit
that referenced
this issue
Jun 25, 2022
istreeter
added a commit
that referenced
this issue
Jun 25, 2022
istreeter
added a commit
that referenced
this issue
Jun 25, 2022
istreeter
added a commit
that referenced
this issue
Jun 25, 2022
pondzix
pushed a commit
that referenced
this issue
Jun 28, 2022
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
For optimum table partitioning, we want to use a auto-generated column on the date of collector timestamp. I imagine a table definition something like this:
I have found that with generated columns we occasionally get exceptions with messages like:
I think it's something to do with how we use the
MERGESCHEMA
copy option, without explicitly setting the table schema, and because different batches can have different sets of entities. These seems to be inconsistent with generated columns.The solution I've found is to always specify every single column in the table in the
COPY INTO
statement. If the column is not in the parquet file then select it asNULL AS unstruct_event_com_acme_myevent_1
.The text was updated successfully, but these errors were encountered: