-
Notifications
You must be signed in to change notification settings - Fork 831
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
intermittent UNIMPLEMENTED error from ambassador #473
Comments
Seems to especially affect first request so could be related to emissary-ingress/emissary#504 Looking at grpc/grpc#16515 (comment) perhaps the way forward for this is https://pypi.org/project/retrying/ |
Seem to be able to resolve this for tests using 656ac5b#diff-3119fc203a07a67876b8d74d342faceeR44 - have only applied that in a particular branch for now |
I think upgrading to latest ambassador (through #480) also resolves this for the notebooks. I was seeing this intermittently in notebooks and haven't seen it in that branch with the latest ambassador |
Turns out the issue is present in 0.53.1 but can be worked around by setting ambassador to run as root. Will check again in 0.60.0 - emissary-ingress/emissary#504 |
Actually issue still present even when running as root. I managed to recreate the ambassador grpc problem from the helm_examples.ipynb. This time though I didn't uninstall and reinstall ambassador. I just followed the notebook and for the AB test scenario (at which point ambassador has been there for a while) I saw was the grpc calls reporting as failed for a good minute while REST requests worked. It seems the issue actually affects the initial period when a new instance comes online, specifically for A/B tests. Oddly, ambassador logs were reporting that the requests were going through successfully and there were no errors in other logs but the response was clearly a failure: Eventually grpc requests start working but it can take a whole minute. The failures always seems to be when using ambassador with single namespace scope. We can work around this by downgrading to 0.50.0, even though that means putting references to the ambassador API back down to v0. |
Is this issue still present in Ambassador v73.0? |
Its not been confirmed or closed by Ambassador : see emissary-ingress/emissary#1587 |
+1 |
We are using ambassador 0.73.0 (quay.io/datawire/ambassador:0.73.0), and we are having this gRPC issue. May I have an update on how to resolving this? |
We need to determine with Ambassador community if this is a Seldon issue or Ambasador issue. If Ambassador we need to evaluate if we continue to support Ambassador long term. |
Issues go stale after 30d of inactivity. |
This issue should be fixed for grpc. There is still the issue of envoy updates not always happening immediately in an Ambassador update which means e2e tests can be flaky. |
Spotted this in the helm_examples notebook but have also seen in the E2E tests. Was seeing:
Might especially affect the A/B test scenario but not sure. Was encountered when testing #445 but it isn't related to that change as has been seen separately.
The text was updated successfully, but these errors were encountered: