[test_runner] Unify wait routines #122

marcoguerri · 2020-06-16T09:21:10Z

The pipeline code was affected by some duplication in the shutdown
control path. This patch removes the duplication, and introduces
only two wait routines in the shutdown control path: waitTargets
and waitControl.

The logic of the test runner is not modular enough to accomodate the need for implementing a cleanup pipeline. This revision refactors the test runner so that the concept of pipeline can be re-used. The eventual goal is to deploy two pipelines, one for testing and one for cleanup.

This revision further simplifies the routing logic and removes the dependency of the pipeline over the list of targets. The latter is important to implement pipelines that might have a variable number of targets coming in that cannot be predicted beforehand.

After extracting the pipeline logic, the routing logic was still part of the pipeline and very monolithic. This patch extracts the routing logic into a dedicated structure and splits it into routeIn logic and routeOut logic, which makes it unit-testable.

Pipeline and routing logic are now split into different files. Unit tests for the routing logic have been added.

This patch adds support for implementing a cleanup pipeline, which runs cleanup steps, after the main test pipeline. One of the requirements of ConTest has always been that it should support a "cleanup" phase, and this adds support for it. The cleanup pipeline is described in the same was test steps have been described so far: and additional "cleanup" section in the test descriptor is deserialized to implement a pipeline which runs the required cleanup steps.

The interface of TestSteps so far has dictated that targets should be forwarded to two different channels for success and failure. This makes several parts of ConTest logic relatively complicated. In order to simplify the framework, this braking change reduces the number out output channels that a test step needs to handle to only one. The object returned by the step might include error information for targets which failed in the test step.

codecov · 2020-06-16T09:24:48Z

Codecov Report

Merging #122 (87f1507) into master (cc4a4dc) will increase coverage by 0.98%.
The diff coverage is 71.34%.

@@            Coverage Diff             @@
##           master     #122      +/-   ##
==========================================
+ Coverage   63.14%   64.12%   +0.98%     
==========================================
  Files          51       58       +7     
  Lines        2846     3097     +251     
==========================================
+ Hits         1797     1986     +189     
- Misses        813      865      +52     
- Partials      236      246      +10

Flag	Coverage Δ
integration	`62.95% <71.12%> (+1.09%)`	⬆️
unittests	`20.29% <16.38%> (+4.07%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
pkg/cerrors/cerrors.go	`0.00% <0.00%> (-66.67%)`	⬇️
pkg/jobmanager/jobmanager.go	`66.95% <ø> (+6.14%)`	⬆️
pkg/jobmanager/status.go	`0.00% <0.00%> (ø)`
pkg/pluginregistry/errors.go	`0.00% <0.00%> (-100.00%)`	⬇️
pkg/target/target.go	`100.00% <ø> (ø)`
plugins/targetlocker/inmemory/inmemory.go	`66.32% <ø> (ø)`
plugins/targetlocker/noop/noop.go	`78.57% <ø> (ø)`
plugins/teststeps/echo/echo.go	`0.00% <0.00%> (ø)`
pkg/runner/job_runner.go	`48.12% <20.00%> (ø)`
pkg/job/job.go	`56.52% <23.07%> (-43.48%)`	⬇️
... and 33 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cc4a4dc...c53e426. Read the comment docs.

The pipeline code was affected by some duplication in the shutdown control path. This patch removes the duplication, and introduces only two wait routines in the shutdown control path: waitTargets and waitControl.

xaionaro

I'll continue later.

xaionaro · 2020-07-30T14:20:23Z

cmds/contest/hosts02.csv

-storage2345,10.10.10.123
-machinelearning1234,192.168.123.231
-uselesshost10,10.0.0.10


out of curiosity:

Why have you removed this lines? :)

xaionaro · 2020-07-30T14:21:14Z

cmds/clients/contestcli-http/start.json

-        },
-        {
-            "TargetManagerName": "CSVFileTargetManager",
-            "TargetManagerAcquireParameters": {
-                "FileURI": "hosts02.csv",
-                "MinNumberDevices": 2,
-                "MaxNumberDevices": 4,
-                "HostPrefixes": [
-                ]
-            },
-            "TargetManagerReleaseParameters": {
-            },
-            "TestFetcherName": "URI",
-            "TestFetcherFetchParameters": {
-                "TestName": "RackProvisioning",
-                "URI": "test_samples/randecho.json"
-            }


out of curiosity:

Why have you removed this lines? :)

xaionaro · 2020-07-30T14:23:57Z

cmds/contest/test_samples/randecho.json

extreme nitpicking:

An extra space.

xaionaro · 2020-07-30T14:33:53Z

pkg/runner/test_runner_pipeline.go

-// error. Termination is signalled via terminate channel.
-func (p *pipeline) waitTargets(terminate <-chan struct{}, completedCh chan<- *target.Target) error {
+// emitStepEvent emits a failure event if a step fail
+func (p *pipeline) emitStepEvent(result *stepResult) error {


I interpret this name differently from what it does: I interpret that it sends an event on either success or failure, while actually it sends an event only on failure. BTW, why we do not send successes? To reduce amount of rows in DB?

xaionaro · 2020-07-30T14:35:07Z

pkg/runner/test_runner_pipeline.go

-		completedTargetError error
-	)
+// waitControl reads results coming from result channels (for steps and routing blocks)
+// until a timeout occurrs. The error handling is different depending on whether


nitpicking:

A typo: occurrs → occurs.

Further simplification in the test runner. Routing blocks can become responsible for waiting for targets directly, so that the pipeline itself can just wait on routing blocks and their results.

marcoguerri · 2021-01-04T16:11:32Z

This was supposed to be closed for the reason mentioned in #207

marcoguerri added 7 commits June 16, 2020 09:08

[test_runner] Split pipeline and router in different files

b37ced4

Pipeline and routing logic are now split into different files. Unit tests for the routing logic have been added.

[test_runner] Prepare support for cleanup steps

2db4e31

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 16, 2020

[test_runner] Unify wait routines

87f1507

The pipeline code was affected by some duplication in the shutdown control path. This patch removes the duplication, and introduces only two wait routines in the shutdown control path: waitTargets and waitControl.

marcoguerri mentioned this pull request Jun 17, 2020

[test_runner] Simplify routing logic, reduce pipeline's dependencies #107

Merged

marcoguerri mentioned this pull request Jul 28, 2020

Remove AddField logic added in #107, leverage logrus logic #127

Closed

xaionaro reviewed Jul 30, 2020

View reviewed changes

[test_runner] Make routing blocks responsible for waiting for targets

c53e426

Further simplification in the test runner. Routing blocks can become responsible for waiting for targets directly, so that the pipeline itself can just wait on routing blocks and their results.

marcoguerri mentioned this pull request Dec 17, 2020

Minor refactor stepRouter routeIin #207

Closed

marcoguerri closed this Jan 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[test_runner] Unify wait routines #122

[test_runner] Unify wait routines #122

marcoguerri commented Jun 16, 2020

codecov bot commented Jun 16, 2020 •

edited

Loading

xaionaro left a comment

xaionaro Jul 30, 2020

xaionaro Jul 30, 2020

xaionaro Jul 30, 2020

xaionaro Jul 30, 2020

xaionaro Jul 30, 2020

marcoguerri commented Jan 4, 2021 •

edited

Loading

[test_runner] Unify wait routines #122

[test_runner] Unify wait routines #122

Conversation

marcoguerri commented Jun 16, 2020

codecov bot commented Jun 16, 2020 • edited Loading

Codecov Report

xaionaro left a comment

Choose a reason for hiding this comment

xaionaro Jul 30, 2020

Choose a reason for hiding this comment

xaionaro Jul 30, 2020

Choose a reason for hiding this comment

xaionaro Jul 30, 2020

Choose a reason for hiding this comment

xaionaro Jul 30, 2020

Choose a reason for hiding this comment

xaionaro Jul 30, 2020

Choose a reason for hiding this comment

marcoguerri commented Jan 4, 2021 • edited Loading

codecov bot commented Jun 16, 2020 •

edited

Loading

marcoguerri commented Jan 4, 2021 •

edited

Loading