UDP proxying support #492

rshriram · 2017-02-16T23:13:55Z

Edit: (@alyssawilk on behalf of @cmluciano)

Design doc:

https://docs.google.com/document/d/1G9IVq7F7Onwinsl6EYzGsdzAGvVbo2FGfcPt35ItIx8)

Roadmap

General API refactoring (Ex. less file-descriptor hardcoding)
UDP listener
UDP "session manager"
Basic UDP proxying sessions (proxy -> host)
Advanced UDP proxying features (timeouts, filters, etc.)
Network filters (need a use-case)
Clear documentation with configuration and developer walkthroughs of UDP features

Original top level comment (@rshriram)

Just like TCP proxying, it would be great if Envoy had support UDP proxying as well.

The current code for TCP proxying is pretty generic for most part. The flow is something like this:
on_connection_received_callback()
-->pick upstream and connect to it
on_data_received_callback(data)
-->write_to_upstream(data)
on_stream_reset_callback() [downstream reset or upstream reset?]
-->cleanups
Based on a cursory scan through the code, there is also a timer that cleans up connections beyond a certain period of inactivity ( @mattklein123 please confirm).

In terms of UDP support, much of the code in filters above can be repurposed or renamed to be generic to TCP/UDP where possible.

The ClientConnectionImpl class hardcodes the socket type to be Stream. This needs to be changed.

UDP packets with source port 0 should be dropped(?)

Instead of creating/destroying UDP connection objects per packet, the process can be optimized by having a keepalive style timer that deletes the connection objects after timer expiry. UDP datagram size can be fixed to one MTU or less, as a first order approximation, that should (RFC 791, RFC 2460). We do not need to buffer up data and send it out. WDYT?

In terms of session affinity, packets from same src port, src ip would go to same dst port, dst ip.

mattklein123 · 2017-02-17T17:25:05Z

Just like TCP proxying, it would be great if Envoy had support UDP proxying as well.

The current code for TCP proxying is pretty generic for most part. The flow is something like this:
on_connection_received_callback()
-->pick upstream and connect to it
on_data_received_callback(data)
-->write_to_upstream(data)
on_stream_reset_callback() [downstream reset or upstream reset?]
-->cleanups
Based on a cursory scan through the code, there is also a timer that cleans up connections beyond a certain period of inactivity ( @mattklein123 please confirm).

There is not currently any idle timer in the tcp_proxy filter. This is a useful feature to add in either case, and we would want this for UDP.

In terms of UDP support, much of the code in filters above can be repurposed or renamed to be generic to TCP/UDP where possible.

Agreed, almost all of the code can be shared. The name of the "tcp_proxy" filter is unfortunate. I don't know if I would bother renaming it right away. We can do that in a dedicated change if we want.

The ClientConnectionImpl class hardcodes the socket type to be Stream. This needs to be changed.

This is related to what @jamessynge was asking about in terms of why the Address interface also includes socket stuff. I mainly did this for simplicity. Ultimately, for UDP upstreams, we would like the ability to specify the upstream probably as udp://1.2.3.4:80 in terms of the cluster definitions, CDS, etc. Given the current code, the simplest way to do this would be have the Address interface also hold the socket type (as you mentioned in Gitter), and remove this parameter from the various socket related functions. The alternative would be to split the Address interface out and have an Address and a SocketAddress, where a SocketAddress contains an Address. I could really go either way on this. I don't think it's a huge deal.

UDP packets with source port 0 should be dropped(?)

Instead of creating/destroying UDP connection objects per packet, the process can be optimized by having a keepalive style timer that deletes the connection objects after timer expiry. UDP datagram size can be fixed to one MTU or less, as a first order approximation, that should (RFC 791, RFC 2460). We do not need to buffer up data and send it out. WDYT?

Per above, I don't think the filter needs to do anything different whatsoever than it does today. The code can pretty much be identical, along with an idle timer to destroy things. I think where you have to deal with UDP is probably inside ConnectionImpl. You are going to need to know that it is UDP and deal with MTU there. Doing anything else will be too complicated I think.

In terms of session affinity, packets from same src port, src ip would go to same dst port, dst ip.

I don't think you need to worry about this. We will need to have UDP listeners, which bind, and have a filter stack. All of the normal rules then apply for where to forward. Along these lines, we are going to need to make the listener configuration more extensible. Right now we just support "port". I would like to extend this to be something along the lines of:

"bind_config": {
  "type": "udp",
  "address": "0.0.0.0:80"
}

Doing the above will make it easy for schemas, allow us to have pipe listeners, do IPv6, etc. In general I would appreciate it if you could sync up with @jamessynge on all of this, as I think it's related to IPv6 stuff, as well as future work I know probably needs to happen around QUIC, etc.

mattklein123 · 2017-02-17T17:26:39Z

@moderation ^^^ Can you provide any info on the specifics of what scenario you need to support in terms MTU handling, etc.? Want to make sure we are hitting a specific use case.

@rshriram if we do this, I would like to do this in several different changes. For example, we could start with adding UDP listeners, which proxy to TCP. That is a pretty straightforward change and is independent.

rshriram · 2017-02-17T19:31:46Z

I agree with splitting this into multiple PRs. There are small changes to different subsystems. We should do this piecemeal to make sure we can triage issues easily.

I am unfamiliar with the requirements on QUIC.

With regard to the bind_config, it looks very structured. But couple of questions: why do we need to key off address as well, when port is what matters? (is this related to the issue that @kyessenov posted?) Secondly, will this config be backward compatible with existing configs ? because, it seems to break the config format.

Here is an alternate format (I am okay with either one frankly).

listeners: [
 {
  "port": 80
   "port_type": "udp|tcp" [tcp is default]
...
}

shalako · 2017-10-31T21:30:42Z

Has there been any progress made on this effort? Is it in active development or open to contribution?

mattklein123 · 2017-10-31T22:01:41Z

No one is working on this that I know of. This did come up today in the context of something that would be good to work on. This is actually a fairly complicated feature and needs some thinking.

@shalako can you provide more color on what you need here actually? Do you just need UDP -> UDP? UDP -> TCP? Should datagram boundaries be preserved? Etc.

rosenhouse · 2017-11-01T20:42:47Z

@shalako can keep me honest here, but I suspect our expectations are:

preserve datagram boundaries
UDP to UDP
conntrack-like behavior is probably fine. i don't know if it is necessary to start with.

cmluciano · 2017-11-07T01:02:41Z

I’m interested in moving this forward. Are there more pertinent discussions that I should take a look at before starting some of the changes mentioned above?

mattklein123 · 2017-11-07T01:09:36Z

@cmluciano I don't think anyone is actively working on this. This one probably is best served by a short design doc (1-2 pages, nothing fancy). Do you want to browse the code and then maybe we can collaborate on the doc contents? Would love to get this being worked on. FYI there is some interest from Cisco in also helping out with this but unclear on when they would have time. I think we can get started if you have cycles.

jevonearth · 2017-11-07T15:40:24Z

FWIW; I'm interested in this work for the purpose of proxying SIP traffic and RTP media streams in and out of a k8s kluster. When appropriate, I'll be happy to assist with setting up some services, and doing some testing, if that's helpful to you @cmluciano & @mattklein123

cmluciano · 2017-11-07T16:36:18Z

@mattklein123 Sounds good to me. I will take a look through the codebase and let you know when I'm ready for the doc.

@jevonearth Thanks! I will ping you when ready

edwarnicke · 2017-11-07T21:42:27Z

Let me ask a few questions here about functional behavior that I don't see in the issue yet.

I presume we want to be able to specify:

udp://${proxyip}:${proxyport} -> udp://${proxiedToIp}:${proxiedToPort}

correct? So a packet's headers would be transformed like this:

dstip = ${proxyIp} -> ${proxyToIp}
dstport = ${proxyPort} -> ${proxyToPort}
srcip = ${clientIp} -> ${proxyIp}
srcport = ${clientPort} -> ${proxyFromPort}

and going the other way:

dstip = ${proxyToIp} -> ${clientIp}
dstport = ${proxyFromPort} -> ${clientPort}
srcip = ${proxyToIp} -> ${proxyIp}
srcport = ${proxyToPort} -> ${proxyPort}

Is that the desired behavior?

mattklein123 · 2017-11-07T23:32:32Z

@hagbard5235 ^ is my assumption, but part of the reason that I think we need a design doc on this one is that it's honestly not clear to me exactly what the behavior should be. For example, it's easy enough to fit UDP into Envoy filter chain semantics by raising onData() for each datagram, but what if the user tries to send a datagram that is too large for the target MTU? (Either because path MTU does not match, or we are doing TCP -> UDP).

Also, there are some thorny issues around listening for UDP datagrams and the Envoy threading/filter model that need to be thought through.

edwarnicke · 2017-11-07T23:35:21Z

Oh good... so I wasn't the only one not seeing clarity then ;)

Sounds like there's a desire to do TCP -> UDP and UDP -> TCP proxying as well. Does anyone have an example use case for those transitions? I'm curious how we anticipate them being used :)

rosenhouse · 2017-11-07T23:38:09Z

Is it ok to start with UDP -> UDP and to allow packets to fragment on the way out, if there's an MTU issue?

mattklein123 · 2017-11-07T23:38:46Z

Sounds like there's a desire to do TCP -> UDP and UDP -> TCP proxying as well

No idea if this is needed or not, I just want to make sure we consider it in the design and exclude with appropriate thinking. Either way the MTU mismatch issue and threading issues will need to be dealt with.

mattklein123 · 2017-11-07T23:39:58Z

allow packets to fragment on the way out

Fragging may not be supported in the environment. In v1 we can likely ignore MTU issues and just document. This leaves threading. Anyway, just want to capture all of this in the design. :)

edwarnicke · 2017-11-07T23:46:32Z

@mattklein123 I'm cool capturing TCP -> UDP and UDP -> TCP in the design :) I was asking because if one has more concrete examples available it often helps in the design process :)

Question, are we doing a pure UDP proxy (ie, we make our decisions on ip proto=UDP and port) or and purely mutate ip:port fields or are we looking to look into the datagrams to make proxy decisions?

#socraticdesign ;)

mattklein123 · 2017-11-08T00:00:11Z

Question, are we doing a pure UDP proxy (ie, we make our decisions on ip proto=UDP and port) or and purely mutate ip:port fields or are we looking to look into the datagrams to make proxy decisions?

My thinking here was to go for full L4 proxy. Basically something like:
UDP listener -> raised datagrams in OnData() -> filter chain

Using this model, the "tcp_proxy" I think should mostly "just work" modulo some minor changes.

For QUIC, in the future, we will have needs to do some pure L4 proxying, but IMO we should try to actually fit this within the existing filter model as much as possible.

The main issue that we have to solve (that I don't know answer to off the top of my head) is how to route the UDP packets between multiple threads. Basically, a connection today is bound to a thread along with its filters. This breaks down for incoming UDP packets that are not part of a connection. E.g., do we have all workers listen for packets and somehow forward? Only initially support UDP w/ 1 worker? etc.

rshriram · 2017-11-08T00:03:36Z

I am not sure how threading becomes an issue. For first cut, it’s basically a dumb datagram proxy. A more fundamental issue is the semantics. Are we going to load balance per datagram? Seems strange. We might need to reuse the Ketama hash or the ip hash and send packets to same destination host. Any other load balancing algorithm seems unintuitive imo. The cluster will have to change as well. It’s wedded to stream semantics in terms of circuit breakers. Notion of failure of a host is not going to work given that it’s datagrams that we are sending (fire and forget). So things like outliers, panic thresholds, etc. are out of the window. A straw man impl would just take a watered down version of tcp proxy, and hardwire it to a ip hash based cluster where everything related to reliability is turned off. @grosenhouse would this be a sufficient first cut for CF?

…

On Tue, Nov 7, 2017 at 6:40 PM Matt Klein ***@***.***> wrote: allow packets to fragment on the way out Fragging may not be supported in the environment. In v1 we can likely ignore MTU issues and just document. This leaves threading. Anyway, just want to capture all of this in the design. :) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#492 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AH0qd7aWNi7HMUhxn-rlv_YIu1xWlVK8ks5s0OpVgaJpZM4MDq2C> .

This is the first commit in a series to support UDP proxying. There are quite a few TODOs in the code before this feature will be considered a MVP. Part of #492

This is the first commit in a series to support UDP proxying. There are quite a few TODOs in the code before this feature will be considered a MVP. Part of envoyproxy/envoy#492 Mirrored from https://github.com/envoyproxy/envoy @ 477fafdaa8423cff1a5c22d58904c22eed9155f3

beriberikix · 2019-11-12T15:16:38Z

Exciting to see the udp_proxy scaffolding get merged! 🎉

What's next? Is the roadmap at the top of the issue directionally accurate?

Another bunch of work towards #492. The remaining work is proper wiring up of upstream cluster management, host health, etc. and documentation. This will be done in the next PR. Signed-off-by: Matt Klein <[email protected]>

Fixes #492 Signed-off-by: Matt Klein <[email protected]>

mattklein123 · 2019-11-16T00:36:39Z

MVP complete pending code reviews here #492 if anyone wants to kick the tires.

Another bunch of work towards #492. The remaining work is proper wiring up of upstream cluster management, host health, etc. and documentation. This will be done in the next PR. Signed-off-by: Matt Klein <[email protected]>

Fixes #492 Signed-off-by: Matt Klein <[email protected]>

Another bunch of work towards envoyproxy/envoy#492. The remaining work is proper wiring up of upstream cluster management, host health, etc. and documentation. This will be done in the next PR. Signed-off-by: Matt Klein <[email protected]> Mirrored from https://github.com/envoyproxy/envoy @ 647c1eeba8622bafdd6add1e7997c1f0bda31be5

Move wasm api v3 into extensions, remove unused v2 (not upstreamed).

….rst (envoyproxy#492)

Description: create a class to manage Envoy lifecycle and allow for graceful teardown. Teardown is now graceful for both iOS and Android. Envoy's codebase expects to have shutdown happen from the main thread. However, in mobile the thread the engine runs in is not the main thread of the process, and thus shutdown/destructors were not getting run from the thread the engine was started on. This PR makes sure that the engine is destructed/shutdown from the thread that it ran in. Risk Level: high. This PR changes state management for the engine, and initializes it on a std::thread instead of a platform thread. iOS thread management is afaict handled gracefully. On Android the native thread has to be attached and detached from the JVM. This PR attaches the native thread, but the work to detach is a bit involved so will come in a subsequent PR. Testing: local device testing. Fixes #492 Co-authored-by: Jose Nino <[email protected]> Signed-off-by: Mike Schore <[email protected]> Signed-off-by: JP Simard <[email protected]>

Description: this is a follow up to #498. This PR introduces `envoy_engine_callbacks`. They are similar in nature to envoy_http_callbacks. The difference being that they are not exposed all the way to consumer level in the library as it is not needed right now. However, one can see how by adding a type erased context pointer, and following the platform patterns for http callbacks we could thread this all the way up if need be. The immediate need for these callbacks is to detach the engine's native thread from the JVM on Android. Risk Level: med -- adds complexity to engine management. Testing: local testing on devices (Lyft and example app on iOS and Android). In conjunction with #498 this PR Fixes #492 #445 Signed-off-by: Jose Nino <[email protected]> Signed-off-by: JP Simard <[email protected]>

Description: #498 did not fully solve #492. The reset cleanly destructed all objects. However, because destruction was posted in to the event dispatcher, the event dispatcher was left with bad accesses. This PR fixes the issue by issuing shutdown on the dispatcher, and only destructing once the event loop has exited and control has returned to the Engine's run function. Risk Level: med - fixing crash on shutdown Testing: local Fixes #492 Signed-off-by: Jose Nino <[email protected]> Signed-off-by: JP Simard <[email protected]>

Description: create a class to manage Envoy lifecycle and allow for graceful teardown. Teardown is now graceful for both iOS and Android. Envoy's codebase expects to have shutdown happen from the main thread. However, in mobile the thread the engine runs in is not the main thread of the process, and thus shutdown/destructors were not getting run from the thread the engine was started on. This PR makes sure that the engine is destructed/shutdown from the thread that it ran in. Risk Level: high. This PR changes state management for the engine, and initializes it on a std::thread instead of a platform thread. iOS thread management is afaict handled gracefully. On Android the native thread has to be attached and detached from the JVM. This PR attaches the native thread, but the work to detach is a bit involved so will come in a subsequent PR. Testing: local device testing. Fixes #492 Co-authored-by: Jose Nino <[email protected]> Signed-off-by: Mike Schore <[email protected]> Signed-off-by: JP Simard <[email protected]>

Description: this is a follow up to #498. This PR introduces `envoy_engine_callbacks`. They are similar in nature to envoy_http_callbacks. The difference being that they are not exposed all the way to consumer level in the library as it is not needed right now. However, one can see how by adding a type erased context pointer, and following the platform patterns for http callbacks we could thread this all the way up if need be. The immediate need for these callbacks is to detach the engine's native thread from the JVM on Android. Risk Level: med -- adds complexity to engine management. Testing: local testing on devices (Lyft and example app on iOS and Android). In conjunction with #498 this PR Fixes #492 #445 Signed-off-by: Jose Nino <[email protected]> Signed-off-by: JP Simard <[email protected]>

Description: #498 did not fully solve #492. The reset cleanly destructed all objects. However, because destruction was posted in to the event dispatcher, the event dispatcher was left with bad accesses. This PR fixes the issue by issuing shutdown on the dispatcher, and only destructing once the event loop has exited and control has returned to the Engine's run function. Risk Level: med - fixing crash on shutdown Testing: local Fixes #492 Signed-off-by: Jose Nino <[email protected]> Signed-off-by: JP Simard <[email protected]>

mattklein123 added enhancement Feature requests. Not bugs or questions. help wanted labels Feb 22, 2017

This was referenced Feb 25, 2017

Bind listener to 0.0.0.0 or 127.0.0.1 #507

Closed

Make listener port binding options extensible #316

Closed

mattklein123 removed the help wanted label Apr 21, 2017

This was referenced May 30, 2017

config/stats: add udp statds address as config option #1019

Merged

UDP cluster support for statsd addresses #1028

Open

josephjacks mentioned this issue Jun 30, 2017

L4 consistent hashing QUIC proxy #1193

Open

kyessenov mentioned this issue Aug 4, 2017

UDP support istio/old_pilot_repo#62

Closed

mattklein123 added the help wanted Needs help! label Oct 28, 2017

kyessenov mentioned this issue Nov 3, 2017

UDP support istio/istio#1430

Open

mattklein123 added a commit that referenced this issue Nov 12, 2019

udp_proxy: scaffolding (#8883)

477fafd

This is the first commit in a series to support UDP proxying. There are quite a few TODOs in the code before this feature will be considered a MVP. Part of #492

mattklein123 mentioned this issue Nov 13, 2019

udp_proxy: implement idle timeout and some stats #8999

Merged

mattklein123 added a commit that referenced this issue Nov 16, 2019

udp proxy: complete MVP

8524995

Fixes #492 Signed-off-by: Matt Klein <[email protected]>

mattklein123 mentioned this issue Nov 16, 2019

udp proxy: complete MVP #9046

Merged

mattklein123 added a commit that referenced this issue Nov 25, 2019

udp proxy: complete MVP

29b8d84

Fixes #492 Signed-off-by: Matt Klein <[email protected]>

mattklein123 closed this as completed in #9046 Dec 5, 2019

mchu1195 mentioned this issue Dec 11, 2019

Envoy support (part 1) mobiledgex/edge-cloud#734

Merged

lizan pushed a commit to lizan/envoy that referenced this issue Apr 24, 2020

Merge pull request envoyproxy#492 from jplevyak/wasm-config

3add410

Move wasm api v3 into extensions, remove unused v2 (not upstreamed).

wolfguoliang pushed a commit to wolfguoliang/envoy that referenced this issue Jan 23, 2021

zh-translation:docs/root/configuration/http/http_filters/http_filters…

26b3eda

….rst (envoyproxy#492)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UDP proxying support #492

UDP proxying support #492

rshriram commented Feb 16, 2017 •

edited by alyssawilk

Loading

mattklein123 commented Feb 17, 2017

mattklein123 commented Feb 17, 2017

rshriram commented Feb 17, 2017 •

edited

Loading

shalako commented Oct 31, 2017

mattklein123 commented Oct 31, 2017

rosenhouse commented Nov 1, 2017

cmluciano commented Nov 7, 2017 •

edited

Loading

mattklein123 commented Nov 7, 2017

jevonearth commented Nov 7, 2017 •

edited

Loading

cmluciano commented Nov 7, 2017

edwarnicke commented Nov 7, 2017

mattklein123 commented Nov 7, 2017 •

edited

Loading

edwarnicke commented Nov 7, 2017

rosenhouse commented Nov 7, 2017

mattklein123 commented Nov 7, 2017

mattklein123 commented Nov 7, 2017

edwarnicke commented Nov 7, 2017

mattklein123 commented Nov 8, 2017

rshriram commented Nov 8, 2017 via email

beriberikix commented Nov 12, 2019

mattklein123 commented Nov 16, 2019

UDP proxying support #492

UDP proxying support #492

Comments

rshriram commented Feb 16, 2017 • edited by alyssawilk Loading

Design doc:

Roadmap

Original top level comment (@rshriram)

mattklein123 commented Feb 17, 2017

mattklein123 commented Feb 17, 2017

rshriram commented Feb 17, 2017 • edited Loading

shalako commented Oct 31, 2017

mattklein123 commented Oct 31, 2017

rosenhouse commented Nov 1, 2017

cmluciano commented Nov 7, 2017 • edited Loading

mattklein123 commented Nov 7, 2017

jevonearth commented Nov 7, 2017 • edited Loading

cmluciano commented Nov 7, 2017

edwarnicke commented Nov 7, 2017

mattklein123 commented Nov 7, 2017 • edited Loading

edwarnicke commented Nov 7, 2017

rosenhouse commented Nov 7, 2017

mattklein123 commented Nov 7, 2017

mattklein123 commented Nov 7, 2017

edwarnicke commented Nov 7, 2017

mattklein123 commented Nov 8, 2017

rshriram commented Nov 8, 2017 via email

beriberikix commented Nov 12, 2019

mattklein123 commented Nov 16, 2019

rshriram commented Feb 16, 2017 •

edited by alyssawilk

Loading

rshriram commented Feb 17, 2017 •

edited

Loading

cmluciano commented Nov 7, 2017 •

edited

Loading

jevonearth commented Nov 7, 2017 •

edited

Loading

mattklein123 commented Nov 7, 2017 •

edited

Loading