fix: extract miner sector state changes #1032

frrist · 2022-08-13T01:46:44Z

This PR fixes #1031

The previous implementation diffed previous and current partitions directly. This was incorrect since miners
may compact their partitions resulting in sectors moving to different partitions, which the previous code proceeded to record as invalid events. Instead, we now collect all sectors from all miner partitions and diff the sets directly.

- previous implementation diffed previous and current partitions, this was incorrect since miners may compact their partitions resulting in sectors moving to different partitions. Instead we collect all sectors from all miner partitions and diff the sets directly.

codecov-commenter · 2022-08-13T01:52:56Z

Codecov Report

Merging #1032 (5881258) into master (2a5df7c) will decrease coverage by 1.1%.
The diff coverage is 31.6%.

@@           Coverage Diff            @@
##           master   #1032     +/-   ##
========================================
- Coverage    35.6%   34.4%   -1.2%     
========================================
  Files          44      44             
  Lines        2881    2925     +44     
========================================
- Hits         1027    1008     -19     
- Misses       1750    1821     +71     
+ Partials      104      96      -8

placer14

This seems good to me.

A thought: It would probably be easy to write a query which proves the invariant isn't in our exports which we could periodically run over our exported data.

Can you outline/include how this data bug was detected originally? I can see how we can run tests like these in CI. (Ideas @kasteph?)

Edit: Would the check be as simple as taking all Termination events and making sure that these sectors never have additional events after the Termination occurs?

placer14 · 2022-08-15T13:27:01Z

tasks/actorstate/miner/sector_events.go

-	faulted := bitfield.New()
-	recovered := bitfield.New()
-	recovering := bitfield.New()
+// SectorStates contains a set of bitfields for active, live, fault, and recovering sectors.


Would be good to mention how this bitfield represents each type of sector represented here. (a packed binary list of sector IDs?)

placer14 · 2022-08-15T13:36:19Z

tasks/actorstate/miner/sector_events.go

-	return out, nil
+
+	// previous faulty sectors minus current active sectors are sectors recovered this epoch.
+	recovered, err := bitfield.IntersectBitField(previous.Faulty, current.Active)


The comment and code don't match. I assume we only want to see which previous faults overlap with current actives... this makes sense as being "Recovered". Let's update this comment.

// previous faulty sectors which match (intersect) active sectors are sectors recovered this epoch.

Whoops, bad copy paste

frrist · 2022-08-16T00:09:35Z

A thought: It would probably be easy to write a query which proves the invariant isn't in our exports which we could periodically run over our exported data.

@davidgasquez is working on this as a part of the dbt workflow.

Edit: Would the check be as simple as taking all Termination events and making sure that these sectors never have additional events after the Termination occurs?

That would be part of the check, but there are more cases, for example - sectors cannot recover before becoming faulted, sectors cannot terminate before being added, faulted sectors cannot become faulty, etc.

A sector lifecycle is a state machine, so we'll want to validate all sector events correspond to valid states.

placer14 · 2022-08-22T14:12:48Z

My approval is sticky. Good catch on accidental context sharing. 🙇

frrist self-assigned this Aug 13, 2022

frrist requested review from placer14, davidgasquez and kasteph August 13, 2022 01:47

frrist added 2 commits August 12, 2022 19:02

fixup: comment code

fb808ec

fixup: lint

c8f7f64

frrist mentioned this pull request Aug 13, 2022

Fault/recover event still happen after a termination event #1031

Closed

placer14 approved these changes Aug 15, 2022

View reviewed changes

fix: code comment

506e0cf

fix: use group context for parallel extraction

5881258

frrist merged commit 45f15be into master Aug 23, 2022

frrist deleted the frrist/fix-1031 branch August 23, 2022 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: extract miner sector state changes #1032

fix: extract miner sector state changes #1032

frrist commented Aug 13, 2022 •

edited

Loading

codecov-commenter commented Aug 13, 2022 •

edited

Loading

placer14 left a comment •

edited

Loading

placer14 Aug 15, 2022

placer14 Aug 15, 2022

frrist Aug 15, 2022

frrist commented Aug 16, 2022

placer14 commented Aug 22, 2022

fix: extract miner sector state changes #1032

fix: extract miner sector state changes #1032

Conversation

frrist commented Aug 13, 2022 • edited Loading

codecov-commenter commented Aug 13, 2022 • edited Loading

Codecov Report

placer14 left a comment • edited Loading

Choose a reason for hiding this comment

placer14 Aug 15, 2022

Choose a reason for hiding this comment

placer14 Aug 15, 2022

Choose a reason for hiding this comment

frrist Aug 15, 2022

Choose a reason for hiding this comment

frrist commented Aug 16, 2022

placer14 commented Aug 22, 2022

frrist commented Aug 13, 2022 •

edited

Loading

codecov-commenter commented Aug 13, 2022 •

edited

Loading

placer14 left a comment •

edited

Loading