Scalable route configuration #6602

htuch · 2019-04-16T17:55:44Z

The existing Envoy RouteConfiguration must be evaluated linearly and contains match criteria with hard to predict execution time (regular expressions). For scale-up Envoy instances where very large route configurations for a host may be present (let's say O(1m)) or multi-tenant Envoy instances, this becomes problematic.

There are many ways this could be solved:

Envoy could optimize based on presented configuration, for example a trie could be recovered from repeated patterns or a specific match order. This is challenging if the existing linear semantics are to be preserved and involves Envoy complexity.
We could have variants of regular expressions, e.g. prefix/suffix/glob or RE2 that are cheaper and more predictable to evaluate.
We could have a parallel RouteConfiguration with trie-like semantics, where evaluation order is designed for efficiency.

Some combination of (2) and (3) seems likely to yield a clean solution IMHO, but I'm keen to hear from others.

@jmarantz @mattklein123 @dmitri-d @yanavlasov

The text was updated successfully, but these errors were encountered:

mattklein123 · 2019-04-16T18:04:22Z

I've thought about this for a while and IMO we need to do (3) which is to have a parallel configuration which supports a much more limited set of matching options designed for sub-linear execution. Optimally, the configuration would allow embedding a "legacy" route configuration inside of it so that once a sub-linear match is done, the user can further do a linear match if desired for extra features.

dmitri-d · 2019-04-18T01:12:29Z

I think a combination of (2) and (3) would work. Not sure re: full-on regex implementation in (2), but some form of simple pattern (a subset of bash pattern-matching perhaps?)/prefix and/or suffix matching might be sufficient for our purposes.

jmarantz · 2019-04-18T01:32:57Z

I've been messing around with RE2 vs std::regex benchmarking on my Mac on the plane, and there's a pretty significant speedup. This includes dramatic improvements executing regular expressions that are thought of as pathologically slow. I will publish a bit later.

However, I think sublinear is the only way to scale, one way or another we need to get to at trie.

I was also thinking it might be possible in some cases to compile a restricted subset of existing Envoy config into a trie, but I haven't sorted out the details. There's some value in being able to simply make the existing configurations run faster. This is because there are layers (e.g. Istio) that already mirror the existing Envoy configs.

Going with (3) would be easier though (less of a science project).

htuch · 2019-04-18T21:07:31Z

@louiscryan

rnd4222 · 2019-05-06T19:59:44Z

Just for the record: Intel Hyperscan is specifically designed to match strings against a set of regular expressions; and (I believe) it uses vector instructions, so constants should be good too.

cmluciano · 2019-05-29T14:09:18Z

@htuch @jmarantz Would you consider this a blocker before implementing nginx style rewrite rules via rds.RedirectAction ?

The final comment on a PR addressing 2092 was closed with the author suggesting that a prefix trie should probably be preferred over std::regex .

mattklein123 · 2019-05-29T14:11:30Z

@cmluciano no, I think that feature can be independently added.

htuch · 2019-08-27T02:53:55Z

FWIW, I have an initial attempt at a design for this at https://docs.google.com/document/d/1orxTIL9FXtgmyl5TtPRBqGjgp1ekL7oOKDd95wxeCRY/edit#.

rojkov · 2021-01-14T12:31:43Z

Re. implementation of regex matching... I think it's possible to speed up even a linear list of matchers if to scan an input string against all the regexes used in the list at once. Here we loose the ability to craft manually an optimized list by scanning the regexes lazily, but enable HW optimizations.

RE2 has got the RE2::Set interface that can be used to build a single NFA out of many regexes. Then one scan is enough to mark all the predicates in the list either true or false. There is a nuance though: RE2 builds DFA lazily for explored states at scan time and caches the result to reuse for future scans. This make memory consumption unpredictable and may trigger error thresholds.

Hyperscan is designed to perform matches against many patterns: it builds a single NFA, decomposes to subcomponents, reduces redundancies, translates the subcomponents to various implementations of DFA, NFA and literal matchers depending on what suites better. The literal matchers are SIMD accelerated. DFA/NFA subcomponents are SIMD accelerated when possible. The problem is this compilation takes more time than in case of RE2. But memory consumption is less and guaranteed to be constant at scan time. Also for SIMD acceleration at least SSSE3 is required.

htuch · 2021-01-15T00:22:15Z

I think with MatcherTrees, it's possible to have a map from regex to next nodes. In that situation, you could build a single NFA, subject to the limits you describe. CC @snowp

manlinl · 2021-11-24T20:33:53Z

Hi Envoy team, what is the current status of MatcherTree feature and any timeline for GA?

A little background, we have 1k+ microservices and plan to build a global service mesh. the O(N) routing match algorithm isn't scalabe.

manlinl · 2021-11-30T02:30:07Z

Just another quick question, Does scoped routing configuration match key to route with O(1) or O(n) algorithm? Thanks!

rgs1 · 2021-11-30T23:46:51Z

Hi Envoy team, what is the current status of MatcherTree feature and any timeline for GA?

A little background, we have 1k+ microservices and plan to build a global service mesh. the O(N) routing match algorithm isn't scalabe.

FWIW, in our Thrift mesh we worked around this via #8994. cc: @fishcakez

htuch · 2022-11-24T02:05:27Z

@snowp @mattklein123 should we consider this work closed by the generic matchers?

euroelessar · 2022-11-24T02:50:09Z

@snowp @mattklein123 should we consider this work closed by the generic matchers?

Does it allow making sub-linear route matching for action selection? (e.g. we are interested in an ability to select cluster & add some per-route metadata for stats/access logs)

euroelessar · 2022-11-24T03:52:44Z

Actually, scratch my comment, I've just noticed that envoy has generic matchers support for routing, which covers my use case.
Should marking it as stable be considered as a pre-requisite for closing the issue though?

mattklein123 · 2022-11-26T18:31:58Z

Actually, scratch my comment, I've just noticed that envoy has generic matchers support for routing, which covers my use case.
Should marking it as stable be considered as a pre-requisite for closing the issue though?

Yeah we should just go ahead and mark this stable IMO. It's been out for long enough. cc @snowp @kyessenov

euroelessar · 2022-11-28T01:33:51Z

Looking more into the implementation, it looks like current matcher api in route config has no support for runtime fractions (as well as other existing matcher flavors). It's somehow complicated by the fact that it does not support embedding existing routing config (or a list of routes) as a final leaf as well.

Should a new issue (issues) be created to track it?

mattklein123 · 2022-11-28T15:45:55Z

It's somehow complicated by the fact that it does not support embedding existing routing config (or a list of routes) as a final leaf as well.

I don't know where we ended up, but I know that the original intention was for the original route implementation to be usable as a leaf node. I thought this was already implemented but maybe not. cc @snowp @kyessenov

sanjaypujare · 2024-06-25T05:57:22Z

This bug is still open. Does it mean there are plans to address it? Specifically for the following mentioned in the first comment.

We could have a parallel RouteConfiguration with trie-like semantics, where evaluation order is designed for efficiency.

This is something we would be really interested in (unless we can achieve this through the generic matcher API?)

htuch added the design proposal Needs design doc/proposal before implementation label Apr 16, 2019

mattklein123 added the help wanted Needs help! label Apr 27, 2019

mattklein123 mentioned this issue May 6, 2019

O(log n) matching for HTTP routes - Feature Request #6823

Closed

euroelessar mentioned this issue Jul 25, 2019

http: Add ability to merge slashes #7621

Merged

mattklein123 added the area/http label Dec 13, 2019

mattklein123 mentioned this issue Jul 24, 2020

rbac: principals evaluation performance #12285

Open

mattklein123 self-assigned this Sep 17, 2020

mattklein123 assigned snowp and unassigned mattklein123 Nov 19, 2020

asraa mentioned this issue Nov 29, 2021

Envoy routing scalability & performance question #19067

Closed

jmarantz mentioned this issue Mar 1, 2022

High CPU usage for matching routes (re2::DFA::InlinedSearchLoop) #20147

Closed

zhxie mentioned this issue Mar 3, 2022

matcher: support prefix match map for prefix matching #20198

Closed

jmarantz mentioned this issue Jul 28, 2022

contrib: add Hyperscan regex engine #21973

Merged

euroelessar mentioned this issue Nov 29, 2022

Matcher-based route config should support runtime_fraction #24239

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalable route configuration #6602

Scalable route configuration #6602

htuch commented Apr 16, 2019

mattklein123 commented Apr 16, 2019

dmitri-d commented Apr 18, 2019 •

edited

Loading

jmarantz commented Apr 18, 2019

htuch commented Apr 18, 2019

rnd4222 commented May 6, 2019

cmluciano commented May 29, 2019

mattklein123 commented May 29, 2019

htuch commented Aug 27, 2019

rojkov commented Jan 14, 2021

htuch commented Jan 15, 2021 •

edited

Loading

manlinl commented Nov 24, 2021

manlinl commented Nov 30, 2021

rgs1 commented Nov 30, 2021

htuch commented Nov 24, 2022

euroelessar commented Nov 24, 2022

euroelessar commented Nov 24, 2022

mattklein123 commented Nov 26, 2022

euroelessar commented Nov 28, 2022

mattklein123 commented Nov 28, 2022

sanjaypujare commented Jun 25, 2024

Scalable route configuration #6602

Scalable route configuration #6602

Comments

htuch commented Apr 16, 2019

mattklein123 commented Apr 16, 2019

dmitri-d commented Apr 18, 2019 • edited Loading

jmarantz commented Apr 18, 2019

htuch commented Apr 18, 2019

rnd4222 commented May 6, 2019

cmluciano commented May 29, 2019

mattklein123 commented May 29, 2019

htuch commented Aug 27, 2019

rojkov commented Jan 14, 2021

htuch commented Jan 15, 2021 • edited Loading

manlinl commented Nov 24, 2021

manlinl commented Nov 30, 2021

rgs1 commented Nov 30, 2021

htuch commented Nov 24, 2022

euroelessar commented Nov 24, 2022

euroelessar commented Nov 24, 2022

mattklein123 commented Nov 26, 2022

euroelessar commented Nov 28, 2022

mattklein123 commented Nov 28, 2022

sanjaypujare commented Jun 25, 2024

dmitri-d commented Apr 18, 2019 •

edited

Loading

htuch commented Jan 15, 2021 •

edited

Loading