[FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. #2340

clarkzinzow · 2024-06-05T01:41:18Z

This PR adds the daft-execution subcrate containing a set of proof-of-concept local compute ops and the partition reference + metadata model for the new executor.

Partial metadata machinery isn't yet implemented since no task scheduler or exchange op has required it yet.

TODOs

Add unit tests for compute ops.
Add doc strings for abstractions.

clarkzinzow · 2024-06-05T19:48:10Z

src/daft-execution/Cargo.toml

+rayon = {workspace = true}
+snafu = {workspace = true}
+sysinfo = {workspace = true}
+tokio = {workspace = true}


Some of these dependencies will be used in future PRs in the stack; I already double-checked that only those required by the last PR in the stack are included here.

codecov · 2024-06-05T19:51:43Z

Codecov Report

Attention: Patch coverage is 46.50655% with 490 lines in your changes missing coverage. Please review.

Please upload report for BASE (main@021b103). Learn more about missing BASE report.

❗ Current head b754804 differs from pull request most recent head 160b172

Please upload reports for the commit 160b172 to get more accurate results.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2340   +/-   ##
=======================================
  Coverage        ?   78.47%           
=======================================
  Files           ?      487           
  Lines           ?    56169           
  Branches        ?        0           
=======================================
  Hits            ?    44081           
  Misses          ?    12088           
  Partials        ?        0

Files	Coverage Δ
src/daft-micropartition/src/lib.rs	`40.00% <ø> (ø)`
...logical_optimization/rules/push_down_projection.rs	`74.56% <100.00%> (ø)`
src/lib.rs	`97.22% <100.00%> (ø)`
src/daft-execution/src/lib.rs	`42.85% <42.85%> (ø)`
src/daft-micropartition/src/micropartition.rs	`92.19% <25.00%> (ø)`
src/daft-execution/src/partition/partition_ref.rs	`52.00% <52.00%> (ø)`
src/daft-execution/src/ops/mod.rs	`0.00% <0.00%> (ø)`
src/daft-execution/src/ops/filter.rs	`0.00% <0.00%> (ø)`
src/daft-execution/src/ops/project.rs	`0.00% <0.00%> (ø)`
src/daft-execution/src/ops/scan.rs	`0.00% <0.00%> (ø)`
... and 7 more

src/daft-execution/src/compute/ops/op_builder.rs

clarkzinzow · 2024-06-06T16:46:39Z

src/daft-execution/src/ops/fused.rs

+        input_meta: &[PartitionMetadata],
+    ) -> ResourceRequest {
+        self.resource_request
+            .or_memory_bytes(input_meta.iter().map(|m| m.size_bytes).sum())


This should eventually take the max of the heap memory estimate for all fused ops in the chain, using max output size estimates from previous ops in the chain. This would require looping in the approximate stats estimate logic that's currently tied to the physical plan, which we previously talked about factoring out.

src/daft-execution/src/ops/mod.rs

src/daft-execution/src/ops/filter.rs

samster25 · 2024-06-07T01:59:49Z

src/daft-execution/src/ops/fused.rs

+#[derive(Debug)]
+pub struct FusedOpBuilder<T> {
+    // Task op at the front of the chain.
+    source_op: Arc<dyn PartitionTaskOp<Input = T>>,


how does this look if we have some of these cases:

BroadcastJoin(ScanOP, ScanOP)

or

Concat(ScanOp, Micropartition)

Discussed offline, current behavior in status quo execution model is to materialize scans before BroadcastJoin and the like, so tabling this as a post proof-of-concept optimization.

samster25 · 2024-06-07T02:16:40Z

src/daft-execution/src/ops/limit.rs

+        input_meta: &[PartitionMetadata],
+    ) -> PartitionMetadata {
+        assert_eq!(input_meta.len(), 1);
+        let input_meta = &input_meta[0];


we should also be updating the number of bytes

For sure, I'll fix that! Note that this is currently unused - the partial metadata machinery is still a TODO for all ops and the surrounding execution model, since no exchange or sink ops have needed it yet.

Hmm our current behavior is to set size_bytes to None, since we don't know the exact new size in bytes until we actually apply the limit. E.g. you could remove 99% of the rows, but 99% of the size in bytes could be in that remaining 1% of rows if those rows are particularly large.

Daft/daft/execution/execution_step.py

Line 552 in 021b103

size_bytes=None,

samster25 · 2024-06-07T02:18:57Z

src/daft-execution/src/ops/monotonically_increasing_id.rs

+        assert_eq!(inputs.len(), 1);
+        let input = inputs.into_iter().next().unwrap();
+        let out = input.add_monotonically_increasing_id(
+            self.num_partitions.load(Ordering::SeqCst) as u64,


shouldn't we be using fetch_add() here instead of relying on with_input_metadata being called?

with_input_metadata() is guaranteed to be called within the scheduler right before submission, while execute() is called at task execution time, potentially on a different machine if using a distributed executor, which wouldn't update the partition counter for other to-be-executed tasks. For that reason, we should ensure such state is mutated within the scheduler before submission.

src/daft-execution/src/ops/shuffle.rs

clarkzinzow requested a review from samster25 June 5, 2024 01:41

github-actions bot added the enhancement New feature or request label Jun 5, 2024

clarkzinzow force-pushed the clark/execution-model-prototype-2-compute-ops branch from 456ecfe to fc4cf7a Compare June 5, 2024 19:33

clarkzinzow changed the title ~~[FEAT] [New Executor] [2/N] Proof-of-concept compute ops and partition reference + metadata model for new executor.~~ [FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. Jun 5, 2024

clarkzinzow commented Jun 5, 2024

View reviewed changes

src/daft-execution/src/compute/ops/op_builder.rs Outdated Show resolved Hide resolved

clarkzinzow commented Jun 6, 2024

View reviewed changes

clarkzinzow mentioned this pull request Jun 6, 2024

[FEAT] [New Executor] [3/N] Full execution model prototype. #2347

Merged

23 tasks

samster25 approved these changes Jun 7, 2024

View reviewed changes

clarkzinzow added 8 commits June 7, 2024 14:35

Add first batch of compute ops for new execution model.

20e65da

Prune daft-execution Cargo manifest.

9d4b189

Move ops to top-level in subcrate.

3123a81

Rename op_builder.rs to fused.rs.

680e257

Add docstrings, fused op fixes.

d522c86

Comment fix.

9a783a0

Add unit tests for task op fusion.

4cd253b

PR feedback

160b172

clarkzinzow force-pushed the clark/execution-model-prototype-2-compute-ops branch from 378ddd0 to 160b172 Compare June 7, 2024 21:37

clarkzinzow merged commit 22ef02b into main Jun 7, 2024
42 checks passed

clarkzinzow deleted the clark/execution-model-prototype-2-compute-ops branch June 7, 2024 21:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. #2340

[FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. #2340

clarkzinzow commented Jun 5, 2024 •

edited

Loading

clarkzinzow Jun 5, 2024 •

edited

Loading

codecov bot commented Jun 5, 2024 •

edited

Loading

clarkzinzow Jun 6, 2024

samster25 Jun 7, 2024

clarkzinzow Jun 7, 2024

samster25 Jun 7, 2024

clarkzinzow Jun 7, 2024 •

edited

Loading

clarkzinzow Jun 7, 2024 •

edited

Loading

samster25 Jun 7, 2024

clarkzinzow Jun 7, 2024

[FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. #2340

[FEAT] [New Executor] [2/N] daft-execution crate + proof-of-concept compute ops and partition reference + metadata model for new executor. #2340

Conversation

clarkzinzow commented Jun 5, 2024 • edited Loading

TODOs

clarkzinzow Jun 5, 2024 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Jun 5, 2024 • edited Loading

Codecov Report

clarkzinzow Jun 6, 2024

Choose a reason for hiding this comment

samster25 Jun 7, 2024

Choose a reason for hiding this comment

clarkzinzow Jun 7, 2024

Choose a reason for hiding this comment

samster25 Jun 7, 2024

Choose a reason for hiding this comment

clarkzinzow Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

clarkzinzow Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

samster25 Jun 7, 2024

Choose a reason for hiding this comment

clarkzinzow Jun 7, 2024

Choose a reason for hiding this comment

clarkzinzow commented Jun 5, 2024 •

edited

Loading

clarkzinzow Jun 5, 2024 •

edited

Loading

codecov bot commented Jun 5, 2024 •

edited

Loading

clarkzinzow Jun 7, 2024 •

edited

Loading

clarkzinzow Jun 7, 2024 •

edited

Loading