-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CHORE] Add TPC-H questions 11-22 to benchmarks (currently skipped) #2299
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome! Really the only downside is our CI takes 14m now which is dangerously close to timing out...
Is it possible to cache the build and run each question in its own action? |
Update from discussion: will look into questions 17, 20, 22 to see if there is a simple way to improve their speeds first before merging |
benchmarking/tpch/answers.py
Outdated
res_1.where(col("C_ACCTBAL") > 0).agg(col("C_ACCTBAL").mean().alias("avg_acctbal")).with_column("lit", lit(1)) | ||
) | ||
|
||
res_3 = orders.select("O_CUSTKEY").distinct() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
discussed offline but we can likely remove these distinct
since we are doing a not exists
and dont care about the dup values.
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #2299 +/- ##
==========================================
+ Coverage 78.47% 79.05% +0.58%
==========================================
Files 473 473
Lines 56045 55150 -895
==========================================
- Hits 43981 43600 -381
+ Misses 12064 11550 -514 |
No description provided.