Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CHORE] Add column range stats from read_sql #2015

Merged
merged 2 commits into from
Mar 27, 2024
Merged

Conversation

colin-ho
Copy link
Contributor

@colin-ho colin-ho commented Mar 14, 2024

Closes #2045

Add min max column range stats from read_sql when possible.

@github-actions github-actions bot added the chore label Mar 14, 2024
)

bounds = [pa_table.column(i)[0].as_py() for i in range(num_scan_tasks - 1)]
pydict = Table.from_arrow(pa_table).to_pydict()
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added this conversion here to align the dtype with the Daft schema dtype.

@colin-ho colin-ho requested a review from samster25 March 14, 2024 22:39
Copy link

codecov bot commented Mar 14, 2024

Codecov Report

Attention: Patch coverage is 5.88235% with 16 lines in your changes are missing coverage. Please review.

Project coverage is 82.67%. Comparing base (578944f) to head (8788d12).

Additional details and impacted files

Impacted file tree graph

@@            Coverage Diff             @@
##             main    #2015      +/-   ##
==========================================
- Coverage   82.70%   82.67%   -0.04%     
==========================================
  Files          62       62              
  Lines        6615     6619       +4     
==========================================
+ Hits         5471     5472       +1     
- Misses       1144     1147       +3     
Files Coverage Δ
daft/sql/sql_scan.py 30.00% <5.88%> (-0.19%) ⬇️

@colin-ho colin-ho merged commit f8b6610 into main Mar 27, 2024
30 of 31 checks passed
@colin-ho colin-ho deleted the colin/read_sql_stats branch March 27, 2024 15:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Populate column stats from read_sql
2 participants