Support `grpc_read_timeout` and `grpc_send_timeout` in annotations #11250

Anddd7 · 2024-04-11T01:57:55Z

What do you want to happen?

We want to configure the grpc_read_timeout and grpc_send_timeout with annotations on ingress.

Since the server-snippet has some potential risk, we have to migrate all snippets to common annotations.
https://github.com/search?q=repo%3Akubernetes%2Fingress-nginx+path%3A%2F%5Einternal%5C%2Fingress%5C%2Fannotations%5C%2F%2F+parser.AnnotationRiskCritical&type=code

Is there currently another issue associated with this?

#2475, but it's inactive and closed.

Does it require a particular kubernetes version?

No

Solutions

1. Add new annotations, e.g.

...
annotations:
  nginx.ingress.kubernetes.io/grpc_read_timeout: 3600
  nginx.ingress.kubernetes.io/grpc_send_timeout: 3600
...

2. Set `grpc_read_timeout` and `grpc_send_timeout` with the same value in `proxy_read_timeout`

Similar like nginx-ingress-controller, just set grpc with proxy settings by default

The text was updated successfully, but these errors were encountered:

Anddd7 · 2024-04-11T01:58:32Z

i'd like to contribute if it's ready to implement

longwuyuan · 2024-04-11T05:48:04Z

/triage accepted

I think a PR will be reviewed because it will offset using snippets
But need to improve he accuracy of reference because that link is a nginx webserver and reverseproxy link and this is a go+lua implementation of ingress API. But yeah it is obviously originating in nginx upstream
Do you want to submit a PR
There is even a video on how to create new annotation in the docs on the developer's help page

longwuyuan · 2024-04-13T02:07:46Z

/assign

Anddd7 · 2024-04-13T12:41:52Z

@longwuyuan yes, i have read the docs and read to create a PR.

I refer to exsiting proxy settings to create 3 annotations for grpc timeout, please take a look, #11258

I'll refine the documentation later.

the upstream nginx is already support this, https://nginx.org/en/docs/http/ngx_http_grpc_module.html#grpc_connect_timeout.

strongjz · 2024-04-25T15:44:55Z

/priority backlog

github-actions · 2024-05-26T01:52:37Z

This is stale, but we won't close it automatically, just bare in mind the maintainers may be busy with other tasks and will reach your issue ASAP. If you have any question or request to prioritize this, please reach #ingress-nginx-dev on Kubernetes Slack.

flphvlck · 2024-08-19T12:19:00Z

Hi @strongjz , maybe this should be noted as breaking change in release notes for 1.11
Because we are using:

      nginx.ingress.kubernetes.io/configuration-snippet: |
        grpc_next_upstream_tries X;
        grpc_read_timeout Xs;
        grpc_send_timeout Xs;

and after upgrade from 1.10 to 1.11 we ended up with these errors:

Error: exit status 1
2024/08/19 12:10:49 [emerg] 60#60: "grpc_read_timeout" directive is duplicate in /tmp/nginx/nginx-cfg658058193:2367
nginx: [emerg] "grpc_read_timeout" directive is duplicate in /tmp/nginx/nginx-cfg658058193:2367
nginx: configuration file /tmp/nginx/nginx-cfg658058193 test failed

In the /tmp/nginx/nginx-cfg658058193 there really are duplicated directives for grpc (default values from annotations and our values from configuration-snippet

			# Grpc settings
			grpc_connect_timeout                    5s;
			grpc_send_timeout                       60s;
			grpc_read_timeout                       60s;

			grpc_next_upstream error timeout http_502 http_503 http_504 non_idempotent;
			grpc_next_upstream_tries X;
			grpc_read_timeout Xs;
			grpc_send_timeout Xs;

We will switch from snippet to annotations to fix this, but I think it's better to know about this from release notes.

Anddd7 · 2024-08-21T10:16:34Z

So we may need a 'merge' mechanism to ensure the annotations and snippets can work together, if there is a duplicate config. Then, we can migrate the snippets to annotations. (if we create more annotations)

flphvlck · 2024-08-21T10:26:44Z

Yeah, that's another option how to deal with it (except "breaking change" note)
I'm in the beggining of the upgrade process in our clusters and I will investigate how big problem is this for us. But it will deffinitely make the whole process harder

Possible merge mechanism - just don't apply default values from annotations when GRPC is used?

Anddd7 · 2024-08-22T01:28:30Z

Yeah, that's another option how to deal with it (except "breaking change" note) I'm in the beggining of the upgrade process in our clusters and I will investigate how big problem is this for us. But it will deffinitely make the whole process harder

Possible merge mechanism - just don't apply default values from annotations when GRPC is used?

ok, the solution 1 would be safer -- add new annotations, it doesn't break the existing "configurations", you can add new annotations and remove snippets at the same time.

how do you think? @tao12345666333 @longwuyuan

sepich · 2024-08-26T08:50:42Z

Sorry guys, but that is a really breaking change not mentioned in any changelogs.
Could you please at least add this to Release and https://github.com/kubernetes/ingress-nginx/tree/main/changelog ?
What happens on upgrade now is that Ingress completely stops serving any sites, as it has some typo in some single config.
Is it possible to have nginx_ingress_controller_config_last_reload_successful as a readinessCheck, so that new version with broken config cannot even be rolled out?

flphvlck · 2024-08-26T10:24:35Z

@sepich it's because of duplicities in nginx.conf - in case you set grpc timeouts in nginx.ingress.kubernetes.io/configuration-snippet

sepich · 2024-08-26T10:37:59Z

Thank you, I know why it happend.
Question is how should I know it before the upgrade?
No any mentions to "check all your snippets or everything would be down" in changelogs.

flphvlck · 2024-08-26T10:48:04Z

I raised issue #11866

flphvlck · 2024-08-26T11:54:06Z

ok, the solution 1 would be safer -- add new annotations, it doesn't break the existing "configurations", you can add new annotations and remove snippets at the same time.

@Anddd7 that would be absolutely marvellous

jon-rei · 2024-09-26T14:11:45Z

We are in the process of upgrading our clusters to the latest version and have encountered this problem. For each cluster we have about 50 ingress resources that have grpc_read_timeout configured as a snippet.

Is there a recommended strategy for handling the upgrade to 1.11 for a large number of ingress resources?

flphvlck · 2024-09-26T20:33:51Z

@jon-rei we are also in the process of upgrading
I have to change settings for the 3 affected options manually in each ingress resource which uses GRPC

worse thing is that the default for grpc_connect_timeout was changed from 60s to 5s - as I understood this change
it's not unusual that you don't even know that you rely on default value

Anddd7 added the kind/feature Categorizes issue or PR as related to a new feature. label Apr 11, 2024

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority labels Apr 11, 2024

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Apr 11, 2024

k8s-ci-robot assigned longwuyuan Apr 13, 2024

Anddd7 mentioned this issue Apr 13, 2024

feat: Add grpc timeouts annotations #11258

Merged

6 tasks

k8s-ci-robot added priority/backlog Higher priority than priority/awaiting-more-evidence. and removed needs-priority labels Apr 25, 2024

github-actions bot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label May 26, 2024

k8s-ci-robot closed this as completed in #11258 May 26, 2024

flphvlck mentioned this issue Aug 26, 2024

Unmentioned breaking change in 1.11 for GRPC #11866

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support `grpc_read_timeout` and `grpc_send_timeout` in annotations #11250

Support `grpc_read_timeout` and `grpc_send_timeout` in annotations #11250

Anddd7 commented Apr 11, 2024

Anddd7 commented Apr 11, 2024

longwuyuan commented Apr 11, 2024

longwuyuan commented Apr 13, 2024

Anddd7 commented Apr 13, 2024 •

edited

Loading

strongjz commented Apr 25, 2024

github-actions bot commented May 26, 2024

flphvlck commented Aug 19, 2024 •

edited

Loading

Anddd7 commented Aug 21, 2024 •

edited

Loading

flphvlck commented Aug 21, 2024 •

edited

Loading

Anddd7 commented Aug 22, 2024

sepich commented Aug 26, 2024

flphvlck commented Aug 26, 2024

sepich commented Aug 26, 2024

flphvlck commented Aug 26, 2024

flphvlck commented Aug 26, 2024

jon-rei commented Sep 26, 2024

flphvlck commented Sep 26, 2024

Support grpc_read_timeout and grpc_send_timeout in annotations #11250

Support grpc_read_timeout and grpc_send_timeout in annotations #11250

Comments

Anddd7 commented Apr 11, 2024

What do you want to happen?

Is there currently another issue associated with this?

Does it require a particular kubernetes version?

Solutions

1. Add new annotations, e.g.

2. Set grpc_read_timeout and grpc_send_timeout with the same value in proxy_read_timeout

Anddd7 commented Apr 11, 2024

longwuyuan commented Apr 11, 2024

longwuyuan commented Apr 13, 2024

Anddd7 commented Apr 13, 2024 • edited Loading

strongjz commented Apr 25, 2024

github-actions bot commented May 26, 2024

flphvlck commented Aug 19, 2024 • edited Loading

Anddd7 commented Aug 21, 2024 • edited Loading

flphvlck commented Aug 21, 2024 • edited Loading

Anddd7 commented Aug 22, 2024

sepich commented Aug 26, 2024

flphvlck commented Aug 26, 2024

sepich commented Aug 26, 2024

flphvlck commented Aug 26, 2024

flphvlck commented Aug 26, 2024

jon-rei commented Sep 26, 2024

flphvlck commented Sep 26, 2024

Support `grpc_read_timeout` and `grpc_send_timeout` in annotations #11250

Support `grpc_read_timeout` and `grpc_send_timeout` in annotations #11250

2. Set `grpc_read_timeout` and `grpc_send_timeout` with the same value in `proxy_read_timeout`

Anddd7 commented Apr 13, 2024 •

edited

Loading

flphvlck commented Aug 19, 2024 •

edited

Loading

Anddd7 commented Aug 21, 2024 •

edited

Loading

flphvlck commented Aug 21, 2024 •

edited

Loading