update of macro for postgres/redshift use of unique_key as a list #4858

McKnight-42 · 2022-03-11T18:59:56Z

#4738 update of macro macro get_delete_insert_merge_sql to check if unique key being passed is a list or string

Description

updates

{% if unique_key is not none %}
    delete from {{ target }}
    where ({{ unique_key }}) in (
        select ({{ unique_key }})
        from {{ source }}
    );
    {% endif %}

to

 {% if unique_key is not none and unique_key != [] %}
        {% if unique_key is sequence and unique_key is not mapping and unique_key is not string %}
            delete from {{target }}
            using {{ source }}
            where (
                {% for key in unique_key %}
                    {{ source }}.{{ key }} = {{ target }}.{{ key }}
                    {{ "and " if not loop.last }}
                {% endfor %}
            );
        {% else %}
            delete from {{ target }}
            where (
                {{ unique_key }}) in (
                select ({{ unique_key }})
                from {{ source }}
            );

        {% endif %}
        {% endif %}

To pair with changes made in #4618 to extend capabilities to postgres and redshift adapters

co-author @jtcohen6

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have added information about my change to be included in the CHANGELOG.

github-actions · 2022-03-11T19:00:10Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the contributing guide.

…ey_as_lists

jtcohen6

@McKnight-42 and I had a chance to pair this afternoon, and made good progress!

Our "standard" tests (copied from plugins) are passing on Postgres here
The exact same tests pass against Redshift in local testing
And against Snowflake, using the delete+insert incremental strategy

Next steps look like:

Rewriting these tests to use the new pytest framework
Make them truly cross-database compatible (I think by turning expected models into seeds)
Add them to the inheritable adapter tests, and turn them on for Redshift + Snowflake + BigQuery + Spark. Delete the copy-pasted versions present in those repos :)

I'd really like to include this change ahead of cutting v1.1.0-b1 on Thursday, and I don't think those next steps need to block merging this PR. If we do merge with next steps still open, we should open issues for the remaining work.

jtcohen6 · 2022-03-14T19:56:33Z

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql

-        from {{ source }}
-    );
-    {% endif %}
+    {% if unique_key %}


I was worried that this may be a breaking change for any users who have defined a unique key named "False". Putting aside how confusing that would be — we tested it out, and this logic should still work.

also an empty string

test/integration/076_incremental_unique_id_test/models/expected/one_str__overwrite.sql

jtcohen6 · 2022-03-14T20:03:12Z

test/integration/076_incremental_unique_id_test/test_incremental_unique_id.py

+        )
+
+        self.assertEqual(status, RunStatus.Error)
+        self.assertTrue("thisisnotacolumn" in exc.lower())


This logic needs to be database-agnostic. The dbt-snowflake tests expect a highly specific error message, which is specific to Snowflake. The test xfails correctly on other databases, but then those databases use their own words to tell us that the column is missing. No good.

The move here is to:

Only check for the missing column name (thisisnotacolumn) in the error message, since any database worth its salt should include the column name in its error message

Lowercase the whole error message, since some databases (Snowflake) will return the column name in uppercase

Turns out, this is also what we need to get the tests passing on Snowflake with delete+insert incremental strategy, since the error message is slightly different (different column alias)

I have opened up a new issue to continue the conversation here #4926

Thanks @McKnight-42! I should been clearer in my statement above: The logic in this PR is database-agnostic. It isn't in some of the other plugins (since we copy-pasted-edited the test cases). We'll want to sort that out when we go to solve #4882

emmyoop

One small question on your changes to the macro.

But I think the bigger thing is the new tests you added should be added in the new pattern of testing instead of the old integration tests pattern.

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql

emmyoop · 2022-03-16T20:40:30Z

@McKnight-42 I just saw @jtcohen6's comment on converting the tests not being a blocker to getting this into the beta. That's fine but can you create an issue now to resolve those last steps so it's not lost?

Next steps look like:

Rewriting these tests to use the new pytest framework

Make them truly cross-database compatible (I think by turning expected models into seeds)

Add them to the inheritable adapter tests, and turn them on for Redshift + Snowflake + BigQuery + Spark. Delete the copy-pasted versions present in those repos :)

McKnight-42 · 2022-03-16T20:47:50Z

@emmyoop new issue created #4882

…ey_as_lists

…t into mcknight/unique_key_as_lists

…tests pass locally for postgres

VersusFacit

Minor suggestions for readability but the logic looks solid.

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql

…ey_as_lists

emmyoop

Looks good now!

iknox-fa

This LGTM with the exception of the cross-database concerns @jtcohen6 pointed out.

I wouldn't be opposed to merge as is for now since we still only run integration tests with postgres as long as we have a follow-on ticket to make it agnostic.

The feedback has been addressed so unblocking this PR

gshank

Looks good!

pre-commit additions

236d69f

McKnight-42 self-assigned this Mar 11, 2022

McKnight-42 added 2 commits March 14, 2022 03:09

Merge branch 'main' of github.com:dbt-labs/dbt into mcknight/unique_k…

6d42c50

…ey_as_lists

added changie changelog entry

b6bdedb

cla-bot bot added the cla:yes label Mar 14, 2022

McKnight-42 and others added 2 commits March 14, 2022 13:40

moving integration test over

6f194b4

Pair programming

5aafc44

McKnight-42 marked this pull request as ready for review March 14, 2022 19:33

McKnight-42 requested a review from a team March 14, 2022 19:33

McKnight-42 requested review from a team as code owners March 14, 2022 19:33

McKnight-42 requested review from VersusFacit and gshank March 14, 2022 19:34

jtcohen6 reviewed Mar 14, 2022

View reviewed changes

emmyoop requested changes Mar 16, 2022

View reviewed changes

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql Outdated Show resolved Hide resolved

McKnight-42 mentioned this pull request Mar 16, 2022

[CT-375] Unique_key as list tests reformat #4882

Closed

1 task

McKnight-42 requested review from emmyoop and jtcohen6 March 17, 2022 15:06

McKnight-42 added 3 commits March 17, 2022 13:10

Merge branch 'main' of github.com:dbt-labs/dbt into mcknight/unique_k…

ba986b6

…ey_as_lists

Merge branch 'mcknight/unique_key_as_lists' of github.com:dbt-labs/db…

1ae3382

…t into mcknight/unique_key_as_lists

removing ref to mapping as seems to be unnecessary check, unique_key …

5cfa2cd

…tests pass locally for postgres

McKnight-42 mentioned this pull request Mar 17, 2022

adding redshift profile version of v1 unique_key as list tests dbt-labs/dbt-redshift#84

Closed

4 tasks

VersusFacit previously requested changes Mar 18, 2022

View reviewed changes

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql Show resolved Hide resolved

core/dbt/include/global_project/macros/materializations/models/incremental/merge.sql Show resolved Hide resolved

Merge branch 'main' of github.com:dbt-labs/dbt into mcknight/unique_k…

c46327e

…ey_as_lists

McKnight-42 requested a review from VersusFacit March 19, 2022 05:47

emmyoop approved these changes Mar 22, 2022

View reviewed changes

iknox-fa approved these changes Mar 22, 2022

View reviewed changes

McKnight-42 mentioned this pull request Mar 22, 2022

[CT-408] [Feature] Database Agnostic error handling #4926

Closed

1 task

gshank approved these changes Mar 22, 2022

View reviewed changes

McKnight-42 merged commit ea5a9da into main Mar 22, 2022

McKnight-42 deleted the mcknight/unique_key_as_lists branch March 22, 2022 15:24

This was referenced Mar 23, 2022

[CT-247] Support multiple unique_key for delete+insert incremental models (Postgres/Redshift/Snowflake/etc) #4738

Closed

[CT-313] Package new adapters tests #4812

Closed

dbeatty10 mentioned this pull request Dec 13, 2023

[CT-3493] [Bug] unique_key list incremental model has performance issues on the delete phase dbt-labs/dbt-adapters#150

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update of macro for postgres/redshift use of unique_key as a list #4858

update of macro for postgres/redshift use of unique_key as a list #4858

McKnight-42 commented Mar 11, 2022 •

edited

Loading

github-actions bot commented Mar 11, 2022

jtcohen6 left a comment

jtcohen6 Mar 14, 2022

nathaniel-may Mar 22, 2022

jtcohen6 Mar 14, 2022

McKnight-42 Mar 22, 2022

jtcohen6 Mar 22, 2022

emmyoop left a comment

emmyoop commented Mar 16, 2022

McKnight-42 commented Mar 16, 2022

VersusFacit left a comment

emmyoop left a comment

iknox-fa left a comment •

edited

Loading

gshank left a comment

update of macro for postgres/redshift use of unique_key as a list #4858

update of macro for postgres/redshift use of unique_key as a list #4858

Conversation

McKnight-42 commented Mar 11, 2022 • edited Loading

Description

Checklist

github-actions bot commented Mar 11, 2022

jtcohen6 left a comment

Choose a reason for hiding this comment

jtcohen6 Mar 14, 2022

Choose a reason for hiding this comment

nathaniel-may Mar 22, 2022

Choose a reason for hiding this comment

jtcohen6 Mar 14, 2022

Choose a reason for hiding this comment

McKnight-42 Mar 22, 2022

Choose a reason for hiding this comment

jtcohen6 Mar 22, 2022

Choose a reason for hiding this comment

emmyoop left a comment

Choose a reason for hiding this comment

emmyoop commented Mar 16, 2022

McKnight-42 commented Mar 16, 2022

VersusFacit left a comment

Choose a reason for hiding this comment

emmyoop left a comment

Choose a reason for hiding this comment

iknox-fa left a comment • edited Loading

Choose a reason for hiding this comment

gshank left a comment

Choose a reason for hiding this comment

McKnight-42 commented Mar 11, 2022 •

edited

Loading

iknox-fa left a comment •

edited

Loading