-
Notifications
You must be signed in to change notification settings - Fork 450
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
After Rebooting all my Control Plane Nodes at the same time: ovn-central details":"inconsistent data","error":"ovsdb error" #3919
Comments
It appears this is only happening on 1 pod which is on headnode-01
|
please see the doc: https://kubeovn.github.io/docs/v1.13.x/ops/recover-db/ |
@bobz965 Great! Kick From cluster worked perfectly. Thank You :) |
After kicking heanode-01 out of the cluster: nbctl show is empty. "_" I think the database is gone? [ use1 ] root@headnode-02: |
Alright, something might be strange with the plugin. seems k ko nbctl isn't returning results, but running the ovn-nbctl show command inside 1 of the ovn-central pods works as expected. |
After coming back, I went to test EIP..... Tried deleting a eip and fip..... not found.
|
Restarting all 3 of the ovn-central pods allowed the kubectl ko command to function as expected. I assume its ttempting to execute this command on an old leader or? |
After bouncing them..... kube-ovn-controller logs are flooded with
|
Bouncing the kube-ovn-controllers and ran into
|
I was able the controller to get passed init by kubectl deleting all the pods, and ips on those subnets which required me to patch the finalizer null. These aren't in production, but I imagine that subnet init function may have a bug in it. |
i think you should kick the bad one, and then clean its nb db and sb db data, and add it back. |
should be fixed by #3928 |
How do we recover from the following condition?
PROBE_INTERVAL is set to 180000
OVN_LEADER_PROBE_INTERVAL is set to 5
OVN_NORTHD_N_THREADS is set to 1
ENABLE_COMPACT is set to false
172.16.0.1
ovsdb-tool: /etc/ovn/ovnnb_db.db: record 175 attempts to truncate log from 1873 to 1839 entries, but commit index is already 1872
backup /etc/ovn/ovnnb_db.db to /etc/ovn/ovnnb_db.db.backup-1713236899-d612f5
detected database corruption for file /etc/ovn/ovnnb_db.db, try to fix it.
ovsdb-tool: /etc/ovn/ovnnb_db.db: record 175 attempts to truncate log from 1873 to 1839 entries, but commit index is already 1872
[{"uuid":["uuid","41001f66-a798-438c-b29f-d308c8b4f853"]},{"uuid":["uuid","d6d7e922-1438-40f8-ab60-6b96b09bd538"]}]
[{"uuid":["uuid","58250fbc-e394-43b0-8d66-be020227f6ce"]},{"uuid":["uuid","7775de3b-d03c-45ae-b9da-74a39b50ea0b"]}]
2024-04-16T03:08:19Z|00001|stream_ssl|ERR|Private key must be configured to use SSL
2024-04-16T03:08:19Z|00002|stream_ssl|ERR|Certificate must be configured to use SSL
2024-04-16T03:08:19Z|00003|stream_ssl|ERR|CA certificate must be configured to use SSL
ovsdb-client: failed to connect to "ssl:[172.16.0.1]:6641" (Protocol not available)
2024-04-16T03:08:19Z|00001|stream_ssl|ERR|Private key must be configured to use SSL
2024-04-16T03:08:19Z|00002|stream_ssl|ERR|Certificate must be configured to use SSL
2024-04-16T03:08:19Z|00003|stream_ssl|ERR|CA certificate must be configured to use SSL
ovsdb-client: failed to connect to "ssl:[172.16.0.2]:6641" (Protocol not available)
2024-04-16T03:08:19Z|00001|stream_ssl|ERR|Private key must be configured to use SSL
2024-04-16T03:08:19Z|00002|stream_ssl|ERR|Certificate must be configured to use SSL
2024-04-16T03:08:19Z|00003|stream_ssl|ERR|CA certificate must be configured to use SSL
ovsdb-client: failed to connect to "ssl:[172.16.0.3]:6641" (Protocol not available)
ovn-nbctl: unix:/var/run/ovn/ovnnb_db.sock: database connection failed ()
2024-04-16T03:08:19Z|00001|reconnect|INFO|unix:/var/run/ovn/ovnnb_db.sock: connecting...
2024-04-16T03:08:19Z|00002|reconnect|INFO|unix:/var/run/ovn/ovnnb_db.sock: connected
ovn-sbctl: unix:/var/run/ovn/ovnsb_db.sock: database connection failed ()
2024-04-16T03:08:20Z|00001|reconnect|INFO|unix:/var/run/ovn/ovnsb_db.sock: connecting...
2024-04-16T03:08:20Z|00002|reconnect|INFO|unix:/var/run/ovn/ovnsb_db.sock: connected
2024-04-16T03:08:21Z|00002|ovsdb_idl|WARN|transaction error: {"details":"inconsistent data","error":"ovsdb error"}
ovn-nbctl: transaction error: {"details":"inconsistent data","error":"ovsdb error"}
The text was updated successfully, but these errors were encountered: