-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HA Egress for AWS EKS - Include functionality to assign Egress IPs as secondary IP addresses to the appropriate node instances #5210
Comments
@lukasmrtvy based on discussions we had on Slack, it seems that you are not really asking for Egress HA support across multiple AZs (see #4385), but you are first asking for Egress (with ExternalIPPool) to work out-of-the box for a single AZ? cc @tnqn |
yes, HA = for multiple nodes, not a single one, and yes, for a single AZ |
@antoninbas - Are there any plans to have an Egress assigned to more than one node for H/A purposes? Current limitation is that each time we rotate workers, or a node crashes there will be an intermittent blip whilst the controller assigns the Egress to a healthy node. |
@robbo10 sorry for the delay, I just came back from vacation today I think a "blip" is unavoidable. Even if we had an "active-active" implementation, in case of an Egress Node failure, we would still need to fail over connections to the remaining Node. Additionally, it's not clear how we would handle return traffic, if the Egress IP (which is the destination IP for return traffic) is "assigned" to multiple Egress Nodes. While some earlier Antrea versions (most notably, v1.11.0 and v1.12.0) had a bug causing longer than normal failover times for Egress IPs, the bug was patched in Antrea v1.13, v1.12.1 and v1.11.3 (look for the following in release notes: Ensure the Egress IP is always correctly advertised to the network, including when the userspace ARP responder is not running or when the Egress IP is temporarily claimed by multiple Nodes.). In the absence of the bug, my experience is that failover is very fast (under 1s). Adding @tnqn in case he has further comments. |
@antoninbas - no problem at all this makes sense. I had a conversation with @tnqn who confirmed the same. We were able to verify that the 1.11.3 release greatly improves Egress failover performance :) Thanks for all the work in releasing this. |
@tnqn has created a documentation PR to describe this limitation |
https://github.com/antrea-io/antrea/blob/main/docs/egress.md#egress-on-cloud describes how Egress works on cloud platform today and what's missing to make it work on AWS. |
This issue is stale because it has been open 90 days with no activity. Remove stale label or comment, or this will be closed in 90 days |
Describe the problem/challenge you have
HA Egress does not work for AWS EKS
Describe the solution you'd like
Include functionality to assign Egress IPs as secondary IP addresses to the appropriate node instances
Anything else you would like to add?
The text was updated successfully, but these errors were encountered: