Generalized parallel processing logic #148

Baliedge · 2023-07-27T14:35:39Z

Convert specialized parallel processing logic to generalized "translate" functions.
This is needed to prepare for significant datamodel changes supporting consistent ordering (#119) as the changes needed will refactor the same areas of code.
In several places, parallel processing has been implemented to improve performance. However, the logic is specialized and is prone to concurrency flaws without proper test coverage. Using a set of generalized functions will decrease the chance of introducing bugs when refactoring for consistent ordering. e.g. Want to be able to change the data type and container iteration logic without breaking channel logic.
Where possible, these translation functions will preserve input order.

Functions:

TranslateSliceParallel(): Iterates a slice, passes values through a translate function in parallel, then sends results sequentially in stable order to result function.
TranslateMapParallel(): Iterates a map, passes value pairs through a translate function in parallel, then sends results sequentially to result function. Ordering is nondeterministic due to nature of map built-in.
TranslatePipeline(): Reads an input channel, passes values through a translate function in parallel, then sends results sequentially to output channel in stable order.

daveshanley

This is a great design! It standardizes async operations and keeps the speed, and introduces reliable ordering.

daveshanley

I've reviewed all the code, and I can't find anything wrong with it. It's better code than existed before. It's standardized async processing now vs. my custom code for each object type. It's easier to understand and easier to debug if needed.

This is great.

codecov · 2023-08-02T12:37:48Z

Codecov Report

Patch coverage: 99.77% and no project coverage change.

Comparison is base (3c9415b) 99.80% compared to head (f95f6b4) 99.80%.

Additional details and impacted files

@@           Coverage Diff            @@
##             main     #148    +/-   ##
========================================
  Coverage   99.80%   99.80%            
========================================
  Files         148      149     +1     
  Lines       10673    10824   +151     
========================================
+ Hits        10652    10803   +151     
  Misses         21       21

Flag	Coverage Δ
unittests	`99.80% <99.77%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
datamodel/low/v2/path_item.go	`100.00% <ø> (ø)`
datamodel/low/v3/paths.go	`98.90% <98.21%> (+0.15%)`	⬆️
datamodel/high/v2/path_item.go	`100.00% <100.00%> (ø)`
datamodel/high/v2/paths.go	`100.00% <100.00%> (ø)`
datamodel/high/v3/components.go	`100.00% <100.00%> (ø)`
datamodel/high/v3/paths.go	`100.00% <100.00%> (ø)`
datamodel/high/v3/responses.go	`100.00% <100.00%> (ø)`
datamodel/low/v2/paths.go	`100.00% <100.00%> (ø)`
datamodel/low/v3/components.go	`100.00% <100.00%> (ø)`
datamodel/low/v3/path_item.go	`98.52% <100.00%> (-0.12%)`	⬇️
... and 1 more

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Baliedge · 2023-08-02T15:42:43Z

@daveshanley Ready for GHA re-run.

daveshanley · 2023-08-08T18:06:46Z

There seems to be a flaky bit of code causing the tests to crash. It's caused by: TestPaths_Build_FailRefDeadEnd

https://github.com/pb33f/libopenapi/blob/main/datamodel/low/v3/paths_test.go#L122

It's crashing because circular detection is not being picked up and the stack is overflowing.

https://github.com/pb33f/libopenapi/blob/main/datamodel/low/extraction_functions.go#L79

Suggestion to fix is find out why the loop is stacking up out of control / put in a hard limit on the depth extractions can go, just as a hard limit on a runaway mine train. Anything more than 100 levels deep is way out of control anyway.

Baliedge · 2023-08-24T20:52:28Z

@daveshanley FYI, been stuck on other projects lately. Maybe able to find time next week to resolve these test coverage issues.

daveshanley · 2023-08-25T11:38:32Z

@daveshanley FYI, been stuck on other projects lately. Maybe able to find time next week to resolve these test coverage issues.

I hear you man, no problem at all. Quality takes time.

daveshanley · 2023-08-28T16:47:01Z

Some changes in my latest PR #162 should fix some of the issues causing the random failures.

daveshanley · 2023-08-30T16:54:45Z

Just needs a tiny bit more coverage. Gotta keep it high.

daveshanley · 2023-09-16T15:38:40Z

I am OK with the coverage, this looks ready to merge. Are you ready?

daveshanley · 2023-09-19T14:17:22Z

@Baliedge is this complete?

Baliedge · 2023-09-21T16:02:32Z

@Baliedge is this complete?

Yes, it's functionally complete. The codecov isn't passing for small portions of uncovered error handlers. I haven't had the opportunity to address that yet.

Baliedge · 2023-09-22T19:40:40Z

@daveshanley Test coverage should now be passing. Please re-run the PR checks.

…ize parallel processing of slices and channels in stable order.

Fix goroutine resource leak in `datamodel/low/v3/path_item.go`.

…ator. Integrate `TranslateMapParallel()` into datamodel for `Paths` to replace specialized async logic.

Integrate `TranslatePipeline()` into datamodel for schema components to replace specialized async logic.

daveshanley · 2023-10-05T13:25:54Z

Which is less work to fix in a conflict, merging in #180 first, or this PR?

daveshanley · 2023-10-05T13:30:21Z

Merging this first.

Baliedge changed the title ~~Generalized parallel processing logic~~ WIP: Generalized parallel processing logic Jul 27, 2023

Baliedge mentioned this pull request Jul 17, 2023

WIP: Consistent ordering of documentation #138

Merged

Baliedge force-pushed the Baliedge/PIP-2552-consistent-ordering-3 branch from f727a94 to 9bff565 Compare July 27, 2023 14:43

daveshanley reviewed Jul 28, 2023

View reviewed changes

Baliedge marked this pull request as ready for review August 1, 2023 19:12

Baliedge changed the title ~~WIP: Generalized parallel processing logic~~ Generalized parallel processing logic Aug 1, 2023

Baliedge force-pushed the Baliedge/PIP-2552-consistent-ordering-3 branch 3 times, most recently from 53e3e31 to 8f710b2 Compare August 1, 2023 19:53

daveshanley approved these changes Aug 2, 2023

View reviewed changes

daveshanley mentioned this pull request Aug 6, 2023

Provide datastructures that retain the original order of items in the document #119

Closed

daveshanley mentioned this pull request Aug 21, 2023

Incorrectly Failing Validation for Circular Reference (Array) #130

Closed

daveshanley mentioned this pull request Aug 28, 2023

Data Race in Compare #156

Closed

Baliedge force-pushed the Baliedge/PIP-2552-consistent-ordering-3 branch from 4960be8 to df9f819 Compare August 29, 2023 18:13

Baliedge force-pushed the Baliedge/PIP-2552-consistent-ordering-3 branch from 21bf650 to 6aed734 Compare September 5, 2023 13:56

Baliedge added 4 commits September 22, 2023 17:49

Implement TranslateSliceParallel and TranslatePipeline to general…

3af1280

…ize parallel processing of slices and channels in stable order.

Refactor v3 Paths to parse YAML using TranslatePipeline.

5918b8a

Fix goroutine resource leak in `datamodel/low/v3/path_item.go`.

Implement TranslateMapParallel() as generalized concurrent map iter…

0c3137a

…ator. Integrate `TranslateMapParallel()` into datamodel for `Paths` to replace specialized async logic.

Fix lint errors.

906de14

Baliedge added 10 commits September 22, 2023 17:49

Fix unit test.

8c78208

Implement TranslatePipeline() as generalized concurrent map iterator.

a8cf9fd

Integrate `TranslatePipeline()` into datamodel for schema components to replace specialized async logic.

Refactor v2 Paths to parse YAML using TranslatePipeline.

f54d79d

Tidy code.

3512e70

Fix compatibility issue with Go 1.19.

2321d26

Tidy code.

a9a2875

Improve coverage. Simplify error handling.

8b6ea5a

Tidy code.

a004a17

Fix tests.

8bfa8c6

Improve test coverage.

f95f6b4

Baliedge force-pushed the Baliedge/PIP-2552-consistent-ordering-3 branch from edb5a35 to f95f6b4 Compare September 22, 2023 21:49

daveshanley mentioned this pull request Oct 5, 2023

fix!: fixed handling of additionalProperties to handle the bool/json-schema nature better #180

Merged

daveshanley merged commit 8b90e9a into pb33f:main Oct 5, 2023
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalized parallel processing logic #148

Generalized parallel processing logic #148

Baliedge commented Jul 27, 2023 •

edited

Loading

daveshanley left a comment

daveshanley left a comment

codecov bot commented Aug 2, 2023 •

edited

Loading

Baliedge commented Aug 2, 2023

daveshanley commented Aug 8, 2023

Baliedge commented Aug 24, 2023

daveshanley commented Aug 25, 2023

daveshanley commented Aug 28, 2023

daveshanley commented Aug 30, 2023

daveshanley commented Sep 16, 2023

daveshanley commented Sep 19, 2023

Baliedge commented Sep 21, 2023

Baliedge commented Sep 22, 2023

daveshanley commented Oct 5, 2023

daveshanley commented Oct 5, 2023

Generalized parallel processing logic #148

Generalized parallel processing logic #148

Conversation

Baliedge commented Jul 27, 2023 • edited Loading

daveshanley left a comment

Choose a reason for hiding this comment

daveshanley left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 2, 2023 • edited Loading

Codecov Report

Baliedge commented Aug 2, 2023

daveshanley commented Aug 8, 2023

Baliedge commented Aug 24, 2023

daveshanley commented Aug 25, 2023

daveshanley commented Aug 28, 2023

daveshanley commented Aug 30, 2023

daveshanley commented Sep 16, 2023

daveshanley commented Sep 19, 2023

Baliedge commented Sep 21, 2023

Baliedge commented Sep 22, 2023

daveshanley commented Oct 5, 2023

daveshanley commented Oct 5, 2023

Baliedge commented Jul 27, 2023 •

edited

Loading

codecov bot commented Aug 2, 2023 •

edited

Loading