Bug Hitting `ThrottlingException` on `GetWorkGroup` with threads turned up #595

nicor88 · 2024-03-12T22:03:03Z

Discussed in #591

^{Originally posted by dacreify March 6, 2024}
Recently we started hitting ThrottlingException on GetWorkGroup calls at this spot in the code:

https://github.com/dbt-athena/dbt-athena/blob/a1b8db5de90b20557bcd5e0c51a30177bcddaa5f/dbt/adapters/athena/impl.py#L231

From CloudTrail it looks like dbt-athena winds up making a GetWorkGroup call for every model run. There's no documented quota for this call but obviously we're hitting one. Actual StartQueryExecution can go 20/second or burst to 80/second so GetWorkGroup definitely seems to be below that in any case.

Anyone else hit this? Can we cache the results of GetWorkGroup to avoid it?

The text was updated successfully, but these errors were encountered:

nicor88 · 2024-03-14T06:42:05Z

@dacreify @juliansteger-sc could you try the latest version from the main branch and verify that the fix solves your issue?

dacreify · 2024-03-15T23:03:24Z

@nicor88 I installed from main and cranked the threads back up on our project. I'm not able to reproduce the throttling now, but I guess it's hard to tell if this is the dbt-athena caching or AWS fixing their regression.

I noticed that the cache key includes the client object:

https://github.com/dbt-athena/dbt-athena/blob/3cdd7ee96f39b347179f729295d8d5c69f140d69/dbt/adapters/athena/impl.py#L219-L224

Given the module-level lock here I'm guessing there's only one instance of athena_client in use for the whole run?

https://github.com/dbt-athena/dbt-athena/blob/3cdd7ee96f39b347179f729295d8d5c69f140d69/dbt/adapters/athena/impl.py#L232-L240

Just confirming that I understand how the caching is working.

nicor88 · 2024-03-16T06:06:28Z

@dacreify lru_cache allow you to cache result in case the same inputs are passed multiple times.
Given the fact that client and workgroup don't cange it should work as expected.
Not sure when using more threads if multiple clients are spawn per thread, I needed to check this behavior, in general lru_cache is thread safe.

juliansteger-sc · 2024-03-16T13:23:15Z

I'm not able to reproduce the throttling now, but I guess it's hard to tell if this is the dbt-athena caching or AWS fixing their regression.

same for us, but fix looks reasonable. thanks for investigating & fixing

nicor88 · 2024-03-18T08:51:01Z

Let's close this issue for now, given the fact that either AWS or the caching behaviour helped to mitigate the issue.

nicor88 added the bug Something isn't working label Mar 12, 2024

nicor88 mentioned this issue Mar 12, 2024

feat: cache workgroup #596

Merged

4 tasks

nicor88 closed this as completed Mar 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Hitting `ThrottlingException` on `GetWorkGroup` with threads turned up #595

Bug Hitting `ThrottlingException` on `GetWorkGroup` with threads turned up #595

nicor88 commented Mar 12, 2024

nicor88 commented Mar 14, 2024

dacreify commented Mar 15, 2024

nicor88 commented Mar 16, 2024 •

edited

Loading

juliansteger-sc commented Mar 16, 2024

nicor88 commented Mar 18, 2024

Bug Hitting ThrottlingException on GetWorkGroup with threads turned up #595

Bug Hitting ThrottlingException on GetWorkGroup with threads turned up #595

Comments

nicor88 commented Mar 12, 2024

Discussed in #591

nicor88 commented Mar 14, 2024

dacreify commented Mar 15, 2024

nicor88 commented Mar 16, 2024 • edited Loading

juliansteger-sc commented Mar 16, 2024

nicor88 commented Mar 18, 2024

Bug Hitting `ThrottlingException` on `GetWorkGroup` with threads turned up #595

Bug Hitting `ThrottlingException` on `GetWorkGroup` with threads turned up #595

nicor88 commented Mar 16, 2024 •

edited

Loading