You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 20, 2024. It is now read-only.
Weave should work for all the nodes in the cluster and all the nodes should have proper internet access
What happened?
I have deployed weave in kubernetes cluster. I have 4 nodes running in the cluster. Out of these 4, only 1 gets the proper connection to internet
How to reproduce it?
Install docker : export VERSION=18.06 && curl -sSL get.docker.com | sh
Install kubernetes : sudo apt-get install -qy kubelet=1.9.7-00 kubectl=1.9.7-00 kubeadm=1.9.7-00
Weave: kubectl apply -f https://git.io/weave-kube-1.6
Above installation works fine and the nodes can be added easily. When the daemonset is deployed, only 1 node works fine, rest shows error
Versions:
weave version: 1.6
docker version:
Client:
Version: 18.06.1-ce
API version: 1.38
Go version: go1.10.3
Git commit: e68fc7a
Built: Tue Aug 21 17:30:52 2018
OS/Arch: linux/arm
Experimental: false
Server:
Engine:
Version: 18.06.1-ce
API version: 1.38 (minimum version 1.12)
Go version: go1.10.3
Git commit: e68fc7a
Built: Tue Aug 21 17:26:37 2018
OS/Arch: linux/arm
Experimental: false
uname -a Linux TX-G1-209 4.14.82-v7+ #1165 SMP Wed Nov 21 16:42:28 GMT 2018 armv7l GNU/Linux
Logs:
$ kubectl logs -n kube-system weave
DEBU: 2018/11/22 10:20:42.456763 [kube-peers] Checking peer "d6:48:62:21:0e:77" against list &{[{7a:99:6e:57:00:20 tx-g1-209} {d6:48:62:21:0e:77 tx-g1-204}]}
INFO: 2018/11/22 10:20:43.310052 Command line options: map[expect-npc:true nickname:tx-g1-204 docker-api: name:d6:48:62:21:0e:77 port:6783 ipalloc-range:10.32.0.0/12 metrics-addr:0.0.0.0:6782 no-dns:true datapath:datapath db-prefix:/weavedb/weave-net host-root:/host ipalloc-init:consensus=9 conn-limit:100 http-addr:127.0.0.1:6784]
INFO: 2018/11/22 10:20:43.310252 weave 2.5.0
INFO: 2018/11/22 10:20:43.861719 Bridge type is bridged_fastdp
INFO: 2018/11/22 10:20:43.861818 Communication between peers is unencrypted.
INFO: 2018/11/22 10:20:43.894230 Our name is d6:48:62:21:0e:77(tx-g1-204)
INFO: 2018/11/22 10:20:43.894659 Launch detected - using supplied peer list: [192.168.88.172 192.168.88.189 192.168.88.173 192.168.88.80 192.168.88.109 192.168.88.180 192.168.88.73 192.168.88.133 192.168.88.123]
INFO: 2018/11/22 10:20:44.073218 Checking for pre-existing addresses on weave bridge
INFO: 2018/11/22 10:20:44.088128 [allocator d6:48:62:21:0e:77] Initialising with persisted data
INFO: 2018/11/22 10:20:44.109920 Sniffing traffic on datapath (via ODP)
INFO: 2018/11/22 10:20:44.110600 ->[192.168.88.80:6783] attempting connection
INFO: 2018/11/22 10:20:44.111148 ->[192.168.88.180:6783] attempting connection
INFO: 2018/11/22 10:20:44.111490 ->[192.168.88.73:6783] attempting connection
INFO: 2018/11/22 10:20:44.112137 ->[192.168.88.73:39501] connection accepted
INFO: 2018/11/22 10:20:44.112373 ->[192.168.88.109:6783] attempting connection
INFO: 2018/11/22 10:20:44.112739 ->[192.168.88.173:6783] attempting connection
INFO: 2018/11/22 10:20:44.113041 ->[192.168.88.133:6783] attempting connection
INFO: 2018/11/22 10:20:44.113323 ->[192.168.88.123:6783] attempting connection
INFO: 2018/11/22 10:20:44.113707 ->[192.168.88.189:6783] attempting connection
INFO: 2018/11/22 10:20:44.113976 ->[192.168.88.172:6783] attempting connection
INFO: 2018/11/22 10:20:44.115007 ->[192.168.88.73:39501|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: cannot connect to ourself
INFO: 2018/11/22 10:20:44.115954 ->[192.168.88.73:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: cannot connect to ourself
INFO: 2018/11/22 10:20:44.158061 Listening for HTTP control messages on 127.0.0.1:6784
INFO: 2018/11/22 10:20:44.159223 Listening for metrics requests on 0.0.0.0:6782
INFO: 2018/11/22 10:20:44.244145 ->[192.168.88.80:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: local "d6:48:62:21:0e:77(tx-g1-204)" and remote "d6:48:62:21:0e:77(tx-g1-192)" peer names collision
INFO: 2018/11/22 10:20:44.251446 ->[192.168.88.173:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: local "d6:48:62:21:0e:77(tx-g1-204)" and remote "d6:48:62:21:0e:77(tx-g1-187)" peer names collision
INFO: 2018/11/22 10:20:44.259880 ->[192.168.88.172:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: local "d6:48:62:21:0e:77(tx-g1-204)" and remote "d6:48:62:21:0e:77(tx-g1-169)" peer names collision
INFO: 2018/11/22 10:20:44.261005 ->[192.168.88.189:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: local "d6:48:62:21:0e:77(tx-g1-204)" and remote "d6:48:62:21:0e:77(tx-g1-172)" peer names collision
INFO: 2018/11/22 10:20:44.262175 ->[192.168.88.133:6783|d6:48:62:21:0e:77(tx-g1-204)]: connection shutting down due to error: local "d6:48:62:21:0e:77(tx-g1-204)" and remote "d6:48:62:21:0e:77(tx-g1-206)" peer names collision
INFO: 2018/11/22 10:20:44.339492 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection ready; using protocol version 2
INFO: 2018/11/22 10:20:44.339853 overlay_switch ->[7a:99:6e:57:00:20(tx-g1-209)] using fastdp
INFO: 2018/11/22 10:20:44.340027 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection added (new peer)
INFO: 2018/11/22 10:20:44.345293 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection shutting down due to error: read tcp4 192.168.88.73:41475->192.168.88.123:6783: read: connection reset by peer
INFO: 2018/11/22 10:20:44.345560 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection deleted
INFO: 2018/11/22 10:20:44.345670 Removed unreachable peer 7a:99:6e:57:00:20(tx-g1-209)
INFO: 2018/11/22 10:20:44.993330 [kube-peers] Added myself to peer list &{[{7a:99:6e:57:00:20 tx-g1-209} {d6:48:62:21:0e:77 tx-g1-204}]}
DEBU: 2018/11/22 10:20:45.064310 [kube-peers] Nodes that have disappeared: map[]
10.36.0.0
192.168.88.172
192.168.88.189
192.168.88.173
192.168.88.80
192.168.88.109
192.168.88.180
192.168.88.73
192.168.88.133
192.168.88.123
DEBU: 2018/11/22 10:20:45.935336 registering for updates for node delete events
INFO: 2018/11/22 10:20:46.400945 ->[192.168.88.123:6783] attempting connection
INFO: 2018/11/22 10:20:46.421560 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection ready; using protocol version 2
INFO: 2018/11/22 10:20:46.421857 overlay_switch ->[7a:99:6e:57:00:20(tx-g1-209)] using fastdp
INFO: 2018/11/22 10:20:46.421991 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection added (new peer)
INFO: 2018/11/22 10:20:46.424036 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection shutting down due to error: write tcp4 192.168.88.73:57443->192.168.88.123:6783: write: connection reset by peer
INFO: 2018/11/22 10:20:46.424382 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection deleted
INFO: 2018/11/22 10:20:46.424513 Removed unreachable peer 7a:99:6e:57:00:20(tx-g1-209)
INFO: 2018/11/22 10:20:47.261932 ->[192.168.88.180:6783] error during connection attempt: dial tcp4 :0->192.168.88.180:6783: connect: no route to host
INFO: 2018/11/22 10:20:48.354767 ->[192.168.88.180:6783] attempting connection
INFO: 2018/11/22 10:20:48.982527 ->[192.168.88.123:6783] attempting connection
INFO: 2018/11/22 10:20:48.992886 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection ready; using protocol version 2
INFO: 2018/11/22 10:20:48.993134 overlay_switch ->[7a:99:6e:57:00:20(tx-g1-209)] using fastdp
INFO: 2018/11/22 10:20:48.993354 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection added (new peer)
INFO: 2018/11/22 10:20:48.997918 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection shutting down due to error: read tcp4 192.168.88.73:59151->192.168.88.123:6783: read: connection reset by peer
INFO: 2018/11/22 10:20:48.998266 ->[192.168.88.123:6783|7a:99:6e:57:00:20(tx-g1-209)]: connection deleted
INFO: 2018/11/22 10:20:48.998439 Removed unreachable peer 7a:99:6e:57:00:20(tx-g1-209)
INFO: 2018/11/22 10:20:50.372397 ->[192.168.88.180:6783] error during connection attempt: dial tcp4 :0->192.168.88.180:6783: connect: no route to host
INFO: 2018/11/
Anything else we need to know?
Using Raspberry pi 3b+. While looking for this issue solution, I found this link #3310 (comment) which says deleting the file /var/lib/weave/weave-netdata.db. It did resolved the issue after a reboot but the same issue occurs after sometime. As per few comments in this post #3314 I have also upgraded the RPBi kernel and then installed the kube and weave, but the issue remains the same, The same post also discussed about disaling the fastdb but looks like I am not able to disable fastdb. Also I do not want this as I have read its a bit slow and consumes more usage.
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
What you expected to happen?
Weave should work for all the nodes in the cluster and all the nodes should have proper internet access
What happened?
I have deployed weave in kubernetes cluster. I have 4 nodes running in the cluster. Out of these 4, only 1 gets the proper connection to internet
How to reproduce it?
Install docker : export VERSION=18.06 && curl -sSL get.docker.com | sh
Install kubernetes : sudo apt-get install -qy kubelet=1.9.7-00 kubectl=1.9.7-00 kubeadm=1.9.7-00
Weave: kubectl apply -f https://git.io/weave-kube-1.6
Above installation works fine and the nodes can be added easily. When the daemonset is deployed, only 1 node works fine, rest shows error
Versions:
weave version: 1.6
docker version:
kubectl version:
uname -a
Linux TX-G1-209 4.14.82-v7+ #1165 SMP Wed Nov 21 16:42:28 GMT 2018 armv7l GNU/Linux
Logs:
$ kubectl logs -n kube-system weave
Anything else we need to know?
Using Raspberry pi 3b+. While looking for this issue solution, I found this link #3310 (comment) which says deleting the file
/var/lib/weave/weave-netdata.db
. It did resolved the issue after a reboot but the same issue occurs after sometime. As per few comments in this post #3314 I have also upgraded the RPBi kernel and then installed the kube and weave, but the issue remains the same, The same post also discussed about disaling thefastdb
but looks like I am not able to disable fastdb. Also I do not want this as I have read its a bit slow and consumes more usage.The text was updated successfully, but these errors were encountered: