-
Notifications
You must be signed in to change notification settings - Fork 223
[cetic/nifi] Readiness Probe Fails #47
Comments
Same issue version: 0.4.2 |
same issue for me. |
Is any updates for this issue? |
Hi, any updates for this issue? We are also looking for something with cluster monitoring |
@RabbidDog @appunni-dishq @fzalila @webdevops-admin @twinklekumarp I don't think this is a NiFi or Helm chart issue. Whenever you see something like When this happens on our production Kubernetes clusters I have to reach out to our Kubernetes cluster administrators to find and fix the problem manually. For more background, see https://blog.mayadata.io/recover-from-volume-multi-attach-error-in-on-prem-kubernetes-clusters |
Describe the bug
Nifi readiness probe doesn't succeed.
During out cluster installation we deploy and delete and redeploy Nifi multiple times. And we use persistent storage for the nifi. Randomly, after a redeployment, Nifi will stop working.
We are not doing anything different in our redeployment.
What happened:
The readiness probe fails with the following message
Readiness probe failed: Node not found with CONNECTED state. Full cluster state: % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0* Trying 10.233.105.161... * TCP_NODELAY set * Connected to nifi-0.nifi-headless.dap-core.svc.cluster.local (10.233.105.161) port 8080 (#0) > GET /nifi-api/controller/cluster HTTP/1.1 > Host: nifi-0.nifi-headless.dap-core.svc.cluster.local:8080 > User-Agent: curl/7.52.1 > Accept: */* > < HTTP/1.1 404 Not Found < Date: Fri, 28 Feb 2020 13:56:06 GMT < X-Frame-Options: SAMEORIGIN < Content-Security-Policy: frame-ancestors 'self' < X-XSS-Protection: 1; mode=block < Content-Type: application/json < Vary: Accept-Encoding < Content-Length: 59 < Server: Jetty(9.4.19.v20190610) < { [59 bytes data] * Curl_http_done: called premature == 0 100 59 100 59 0 0 6072 0 --:--:-- --:--:-- --:--:-- 6555 * Connection #0 to host nifi-0.nifi-headless.dap-core.svc.cluster.local left intact parse error: Invalid numeric literal at line 1, column 5 parse error: Invalid numeric literal at line 1, column 5
We also see these kind of errors
Multi-Attach error for volume "pvc-366c853a-2e59-4443-a1bd-0f8a7d9c2d51" Volume is already exclusively attached to one node and can't be attached to another
What you expected to happen:
The Readiness probe must pass
How to reproduce it (as minimally and precisely as possible):
Hard to say because we don't know what causes it. But best bet to reproduce would be to deploy nifi with persistent volumes. Then delete deployment and redeploy again. The might show up .
Anything else we need to know:
The text was updated successfully, but these errors were encountered: