[service/extensions] extension lifecycle order #8733

atoulme · 2023-10-24T03:53:56Z

Description:
Enforce order of start and shutdown of extensions according to configuration

Link to tracking Issue:
Fixes issue #8732

Testing:
Unit tests

yurishkuro

Thanks!

…s according to configuration

codecov · 2023-10-24T04:52:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files	Coverage Δ
service/extensions/extensions.go	`83.89% <100.00%> (+1.17%)`	⬆️

... and 6 files with indirect coverage changes

📢 Thoughts on this report? Let us know!.

dmitryax

Just to hold off until discussion in #8732 is resolved

atoulme · 2023-10-25T05:52:32Z

I received a comment that I should stop extensions in reverse order that they are started, I'll push that. After that, we can discuss the merits of this PR.

yurishkuro · 2023-10-27T17:28:00Z

Another implementation option discussed on the issue is to build support for extension dependencies in the graph/ module. Then an extension can programmatically declare a dependency on another extension and the graph would guarantee correct execution order. This would be a larger change, but preferable over explicit config order where users need to worry about listing extensions in the right order.

dmitryax · 2023-10-27T21:00:06Z

Another implementation option discussed on the issue is to build support for extension dependencies in the graph/ module. Then an extension can programmatically declare a dependency on another extension and the graph would guarantee correct execution order. This would be a larger change, but preferable over explicit config order where users need to worry about listing extensions in the right order.

Makes sense, but it would require another predefined way to specify dependencies on the extensions in the configuration interface. The current dependencies topology is constructed directly from the pipeline YAML definition. Introducing an additional source for building the graph seems like an excessive complication of the user interface. And some extensions naturally do not provide any capabilities to other components, only to the end-user. I think it's ok to keep it simple like implemented in this PR going forward.

yurishkuro · 2023-10-27T21:31:43Z

Introducing an additional source for building the graph seems like an excessive complication of the user interface.

Don't get me wrong, I am happy with this PR. But when we talk about complicating user interface, I think it's important to remember which users are exposed to such interface. With dependencies expressed programmatically, a very small number of users who write their own extensions are affected. With dependencies expressed via config order, all users of the collector are exposed to the notion that there are dependencies between extensions, even though there's very little they can actually do about it. E.g. jaeger_query extension simply will never work if it's placed ahead of jaeger_storage in the configuration. So in my documentation I will have to mention "always put it last", even though users didn't really need to be exposed to this accidental complexity.

dmitryax · 2023-10-27T22:09:46Z

@yurishkuro, how do you envision a programmatic way to define the dependencies? Something like an optional interface for extensions to define a dependency on another extension interface? I believe that doesn't require integration with the pipeline graph. We would still start all the extensions before the pipelines.

yurishkuro · 2023-10-27T22:19:27Z

how do you envision a programmatic way to define the dependencies?

I'd add an optional argument dependsOn []component.ID here:

opentelemetry-collector/extension/extension.go

Lines 103 to 107 in 6405e15

    
           func NewFactory( 
        
           	cfgType component.Type, 
        
           	createDefaultConfig component.CreateDefaultConfigFunc, 
        
           	createServiceExtension CreateFunc, 
        
           	sl component.StabilityLevel) Factory {

I'm not sure why these constructor APIs are currently not made extensible by taking args as a struct instead of fixed positional args.

dmitryax · 2023-10-27T22:22:53Z

I'd add an optional argument dependsOn []component.ID here

In that case, you can only define a hardcoded dependency on a particular component. There should be a way to define a dependency on a class of extensions like storage or encoding, which is defined by an interface an extension implements.

yurishkuro · 2023-10-27T22:40:20Z

In that case, you can only define a hardcoded dependency on a particular component. There should be a way to define a dependency on a class of extensions like storage or encoding, which is defined by an interface an extension implements.

I personally would avoid duck-typing dependencies, it's better to have explicit ones by ID. There is only one pool of extensions, so what if I have some components depending on one kind of encoding extension and other components depending on another? When they refer to those by ID there is no ambiguity, but duck-typing only works when there is exactly one instance defined for a given interface.

dmitryax · 2023-10-28T00:32:01Z

Extensions should provide the functionality to components via interfaces like storage. Components dependent on that interface can choose any storage extension even if it's implemented outside of OTel core/contrib repos. Choosing a particular storage implementation is exposed to the user with a config option. Having a hardcoded set of IDs means that custom extensions of particular classes cannot be implemented outside, and every extension dependent on a particular class of extension has to maintain a set of all instances, e.g., each time a new storage/encoding extension is added, all dependent components have to be updated.

There is only one pool of extensions, so what if I have some components depending on one kind of encoding extension and other components depending on another?

They should be two classes of extensions that implement two different (or additional) interfaces.

yurishkuro · 2023-10-28T01:56:05Z

@dmitryax I didn't mean that the dependency has to be hardcoded: for some extensions it may be sufficient, others may need to express their dependencies after reading their own config. That means the dependencies should not be exposed via NewFactory, but as part of CreateExtension constructor. You're right, in this case a runtime interface check is a good solution - here's a prototype #8768 (to be combined with this PR for the actual ordered start/shutdown)

yurishkuro · 2023-10-28T02:00:28Z

service/extensions/extensions.go

@@ -77,7 +82,8 @@ func (bes *Extensions) NotifyPipelineReady() error {
 func (bes *Extensions) NotifyPipelineNotReady() error {
 	// Notify extensions in reverse order.


is this accurate?

Looks like it's not. @atoulme can you please address this?

How would you like me to address this? It feels like this comment needs to be removed.

dmitryax · 2023-10-29T22:14:00Z

@yurishkuro, I think we can merge this one and rebase #8768 on top of it since it won't cause any breaking changes, right?

yurishkuro · 2023-10-30T01:16:43Z

I think we can merge this one and rebase #8768 on top of it since it won't cause any breaking changes, right?

@dmitryax my PR is already based on this one (it includes the original commits). I don't have a strong objection to merging each one separately, but it means the 1st PR will introduce one way of ordering (based purely on config order) and 2nd will override that in favor of the order defined by the DependentExtension interface (with other order still being random). In theory it's possible to support both at the same time: by default start with config-based order as documentation implies, but if DependentExtension interfaces are detected in some extensions move them to the end and apply topological sort (seems a bit complicated). I think just the 2nd approach is cleaner - no order guarantees unless explicitly requested via interface.

dmitryax · 2023-11-01T04:49:23Z

I think just the 2nd approach is cleaner - no order guarantees unless explicitly requested via interface.

Ok, makes sense to me. Especially if that simplifies the implementation

**Description**: Enforce order of start and shutdown of extensions according to their internally declared dependencies **Link to tracking Issue**: Resolves #8732 **Motivation**: This is an alternative approach to #8733 which uses declaration order in the config to start extensions. That approach (a) enforces order when it's not always necessary to enforce, and (b) exposes unnecessary complexity to the user by making them responsible for the order. This PR instead derives the desired order of extensions based on the dependencies they declare by implementing a `DependentExtension` interface. That means that extensions that must depend on others can expose this interface and be guaranteed to start after their dependencies, while other extensions can be started in arbitrary order (same as happens today because of iterating over a map). The extensions that have dependencies have two options to expose them: 1. if the dependency is always static (e.g. `jaeger_query` extension depending on `jaeger_storage` as in the OP), the extension can express this statically as well, by returning a predefined ID of the dependent extension 2. in cases where dependencies are dynamic, the extension can read the names of the dependencies from its configuration. The 2nd scenario is illustrated by the following configuration. Here each complex extension knows that it needs dependencies that implement `storage` and `encoding` interfaces (both existing APIs in collector & contrib), but does not know statically which instances of those, the actual names are supplied by the user in the configuration. ```yaml extensions: complex_extension_1: storage: filestorage encoding: otlpencoding complex_extension_2: storage: dbstorage encoding: jsonencoding filestorage: ... dbstorage: ... otlpencoding: jsonencoding: ``` **Changes**: * Introduce `DependentExtension` optional interface * Change `Extensions` constructor to derive the required order using a directed graph (similar to pipelines) * Inherited from #8733 - use new ordered list of IDs to start/stop/notify extensions in the desired order (previously a map was used to iterate over, which resulted in random order). * Tests **Testing**: Unit tests --------- Signed-off-by: Yuri Shkuro <[email protected]> Co-authored-by: Antoine Toulme <[email protected]>

dmitryax · 2023-11-02T04:09:51Z

Superseded by #8768

atoulme requested review from a team and bogdandrutu October 24, 2023 03:53

yurishkuro approved these changes Oct 24, 2023

View reviewed changes

atoulme force-pushed the extension_lifecycle_order branch from a82289a to 312876d Compare October 24, 2023 04:46

[service/extensions] enforce order of start and shutdown of extension…

24dfe76

…s according to configuration

atoulme force-pushed the extension_lifecycle_order branch from 312876d to 24dfe76 Compare October 24, 2023 04:46

dmitryax requested changes Oct 25, 2023

View reviewed changes

shutdown in reverse order

3d17c0d

dmitryax approved these changes Oct 27, 2023

View reviewed changes

yurishkuro mentioned this pull request Oct 28, 2023

[service/extensions] extension lifecycle order #8768

Merged

yurishkuro reviewed Oct 28, 2023

View reviewed changes

remove comment regarding order of notifications

84826a3

dmitryax closed this Nov 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[service/extensions] extension lifecycle order #8733

[service/extensions] extension lifecycle order #8733

atoulme commented Oct 24, 2023

yurishkuro left a comment

codecov bot commented Oct 24, 2023 •

edited

Loading

dmitryax left a comment

atoulme commented Oct 25, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 28, 2023

yurishkuro commented Oct 28, 2023

yurishkuro Oct 28, 2023

dmitryax Oct 29, 2023

atoulme Oct 31, 2023

dmitryax commented Oct 29, 2023

yurishkuro commented Oct 30, 2023

dmitryax commented Nov 1, 2023 •

edited

Loading

dmitryax commented Nov 2, 2023

		@@ -77,7 +82,8 @@ func (bes *Extensions) NotifyPipelineReady() error {
		func (bes *Extensions) NotifyPipelineNotReady() error {
		// Notify extensions in reverse order.

[service/extensions] extension lifecycle order #8733

[service/extensions] extension lifecycle order #8733

Conversation

atoulme commented Oct 24, 2023

yurishkuro left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 24, 2023 • edited Loading

Codecov Report

dmitryax left a comment

Choose a reason for hiding this comment

atoulme commented Oct 25, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 27, 2023

yurishkuro commented Oct 27, 2023

dmitryax commented Oct 28, 2023

yurishkuro commented Oct 28, 2023

yurishkuro Oct 28, 2023

Choose a reason for hiding this comment

dmitryax Oct 29, 2023

Choose a reason for hiding this comment

atoulme Oct 31, 2023

Choose a reason for hiding this comment

dmitryax commented Oct 29, 2023

yurishkuro commented Oct 30, 2023

dmitryax commented Nov 1, 2023 • edited Loading

dmitryax commented Nov 2, 2023

codecov bot commented Oct 24, 2023 •

edited

Loading

dmitryax commented Nov 1, 2023 •

edited

Loading