[RFC] Offline Background Tasks #12361

linuxpi · 2024-02-19T05:54:40Z

[Detailed Design Proposal] #13554

Introduction

Opensearch process running with data role, has responsibilities to execute various Background Tasks apart from indexing & search, some of these are:

Segment Merges
Force Merges
Re-indexing
Remote Garbage Collection
Shard Split/Shrink
Snapshots etc.

These tasks are a crucial part of an Opensearch Cluster. For example, Segment Merges ensure indices are in an optimal state. As an index grows and data is constantly added or updated, these segments need to be periodically merged to maintain efficient search performance and minimum storage footprint.

This is even more important for indices where data ingestion is sparse over time leading to high number of small-small segments. Segments Merges combines these segments into larger ones ensuring better overall index performance.

Similarly, each background task has its own importance.

Is your feature request related to a problem? Please describe

While being a crucial part of Opensearch, these Tasks consume some resources, taking a toll on the process which is supposed to deliver predictable and consistent indexing and search throughput. For ex: Segment merges is an important, frequent and heavy operation which demands a good chunk of available resources. Force Merging to lesser no of segment is an even heavier toll.

Apart from that, the configured resources on the node might not be sufficient to perform these operations along with incoming traffic, in a expected timeframe, which leads to timeouts/failures, eventually delays to background operations.
Apart from that any failures/bugs in these background operations tampers with core operations.

Describe the solution you'd like

Allow users the ability to segregate such operations to separate/dedicated node(s), it helps them scale indexing/search performance predictably without having to compete for resources with background tasks. Similarly background tasks won't be impacted by any surge in core operations traffic.

With introduction of Remote Store, offloading background operations makes even more sense as data is separated out in Remote Store and efficient to interact with, from a separate/dedicated node.

Proposal is to introduce a separate fleet of Nodes(Offline Fleet) to execute all background tasks. This ensures full segregation from core operations and allows users to independently scale this fleet based on the pending background tasks.

To begin with, we can target Segment Merges or Force Merges and allow Remote Store Clusters the ability to separate out merges. Later we can extend it to other background tasks and even think about how to extend the functionality for non Remote Store clusters.

Here is high level view of how the flow looks like with Offline Fleet for a Cluster.

The Added Cost

Not all the users would want to spin up separate nodes for background operations, so however we choose to implement/execute this, we would ensure status quo is maintained.

There is obviously an added cost of the Offline Fleet, which would be directly dependent on the no of nodes provisioned in the Offline Fleet.

Apart from that, with Offline fleet, there would be 2 additional downloads. Consider Segment Merges:

Offline Fleet Node would have to download the Segments to be Merged, today since the segments are already present in local, there is no download needed.
Once the Merged Segments are uploaded to Remote Store, the data node with corresponding Shard would download those merged segments

In future, we could also support a hybrid model where light weight Tasks could be run locally on Data Nodes while others could be offloaded to Background Fleet.

As we progress, I plan on adding more details to the individual components involved and how they interact with each other and existing component.

Related component

Storage

Describe alternatives you've considered

Apart from the approach mentioned above, another option would be isolation of resources on the data node itself for core(indexing/search) operations and other adhoc operations like merges and snapshots. This would have less friction from users in adoption as they don’t have to provision a separate fleet. But it has some caveats which doesn't make it much appealing:

We wouldn’t be able to independently scale resources for merges without affecting core operations.
Reserving resources for adhoc operations on the data node might not be optimal as all the nodes will not have merges to be performed all the time. Instead pooling all the merge operations from all nodes together into dedicated nodes would give better utilization of dedicated resources.
Complete Isolation of resources on the same node is not be as trivial to solve.

Additional context

No response

peternied · 2024-02-21T16:34:11Z

[Triage - attendees 1 2 3 4 5]
@linuxpi Thanks for filing, looking forward to seeing how this progresses

linuxpi · 2024-03-21T13:30:12Z

Phases

Phase

Goal - Have basic framework ready to run Segment Merges(including ForceMerge) on Dedicated background tasks nodes while maintaining status quo

Meta - #12725

To achieve the goal mentioned above, we need to explore concrete solutions for the following items, which in upcoming Phases, could be extended to various other Background Tasks like Snapshot etc.

Separate out Merge Functionality to an independent Component

Most of the codebase for Opensearch today exists as a Monolith in :server hosting code related to various background tasks, including Merge. It would be an anti-pattern to build entire :sever jar and host on Offline Node, which is just responsible for performing Merges. We need a way to separate out individual components like “Merge” and be able to run separately on Offline Fleet.

Build a Task Coordination Framework to manage task lifecycle.

With Offline fleet, data nodes and Offline Fleet nodes itself can submit background tasks to Offline Fleet. At any point, the no of Tasks submitted might be too much for available nodes in the Offline Fleet to distribute amongst themselves.

Even if we do try to assign a task to a particular node right after its submitted, the node may or may not have resources at that time to start the task and would need to put it into a “Queue”. Apart from that, if the node goes down, and this Queue is not persisted in Remote, all those tasks in Queue are lost.

Phase #2

Goal - Onboard more usecases like Remote GC, Snapshots

dblock · 2024-05-22T16:35:21Z

Late to this game coming from another PR. I think the name "offline" is confusing, would call these "worker" nodes.

linuxpi added enhancement Enhancement or improvement to existing feature or request untriaged labels Feb 19, 2024

github-actions bot added the Storage Issues and PRs relating to data and metadata storage label Feb 19, 2024

linuxpi self-assigned this Feb 19, 2024

linuxpi changed the title ~~[RFC] Offline Merge~~ [RFC] Offline Background Tasks Feb 19, 2024

peternied added the RFC Issues requesting major changes label Feb 21, 2024

peternied removed the untriaged label Feb 21, 2024

This was referenced Mar 18, 2024

[META] [Phase #1] Offline Background Tasks #12725

Open

[Feature Request] Separation of Merges #12726

Open

[Feature Request] Background Tasks #12727

Open

shwetathareja added the merges label Mar 19, 2024

ankitkala mentioned this issue Mar 21, 2024

[RFC] Support for writable warm indices on Opensearch #12809

Open

linuxpi mentioned this issue May 6, 2024

[Design Proposal] Offline Background Tasks #13554

Open

andrross added the Roadmap:Cost/Performance/Scale Project-wide roadmap label label May 14, 2024

sohami mentioned this issue May 24, 2024

[RFC] Search performance on warm index #13806

Open

gbbafna mentioned this issue Aug 8, 2024

Add Varun Bansal as maintainer #15163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Offline Background Tasks #12361

[RFC] Offline Background Tasks #12361

linuxpi commented Feb 19, 2024 •

edited

Loading

peternied commented Feb 21, 2024

linuxpi commented Mar 21, 2024 •

edited

Loading

dblock commented May 22, 2024

[RFC] Offline Background Tasks #12361

[RFC] Offline Background Tasks #12361

Comments

linuxpi commented Feb 19, 2024 • edited Loading

Introduction

Is your feature request related to a problem? Please describe

Describe the solution you'd like

The Added Cost

Related component

Describe alternatives you've considered

Additional context

peternied commented Feb 21, 2024

linuxpi commented Mar 21, 2024 • edited Loading

Phases

Phase

dblock commented May 22, 2024

linuxpi commented Feb 19, 2024 •

edited

Loading

linuxpi commented Mar 21, 2024 •

edited

Loading