Report the volume of etcd writes via a diagnostic #14604

smarterclayton · 2017-06-12T20:52:23Z

New EtcdWriteVolume diagnostic measures the number of writes in a time
period to determine where significant write volume is going.

[test]

@derekwaynecarr @eparis

Will make debugging this easier next time:

$ ETCD_WRITE_VOLUME_DURATION=10s oadm diagnostics EtcdWriteVolume --master-config=openshift.local.config/master/       master-config.yaml
[Note] Determining if client configuration exists for client/cluster diagnostics
debug: Reading client config at /Users/clayton/projects/origin/src/github.com/openshift/origin/openshift.local.        config/master/admin.kubeconfig
Info:  Successfully read a client config file at '/Users/clayton/projects/origin/src/github.com/openshift/origin/      openshift.local.config/master/admin.kubeconfig'

[Note] Running diagnostic: EtcdWriteVolume
       Description: Check the volume of writes against etcd and classify them by operation and key for 10s

Info:  Measured 0.2 writes/sec
       /                                                                          2 100.0%
       /v3:PUT                                                                    2 100.0%
       /v3:PUT/kubernetes.io                                                      2 100.0%
       /v3:PUT/kubernetes.io/events                                               1  50.0%
       /v3:PUT/kubernetes.io/events/default                                       1  50.0%
       /v3:PUT/kubernetes.io/events/default/datadir-mysql-0.14c770c1577b8f64      1  50.0%
       /v3:PUT/kubernetes.io/masterleases                                         1  50.0%
       /v3:PUT/kubernetes.io/masterleases/10.192.209.221                          1  50.0%

[Note] Summary of diagnostics execution (version v3.6.0-alpha.2+021fabc-135-dirty):
[Note] Completed with no errors or warnings seen.

smarterclayton · 2017-06-12T20:57:14Z

@sosiouxme i added the "default skip" behavior I asked about.

eparis · 2017-06-13T02:58:05Z

My version worked just fine, geez!
timeout 5m etcdctl --endpoints=https://ip-1-2-3-4.ec2.internal:2379,https://ip-1-2-3-4.ec2.internal:2379,https://ip-1-2-3-5.ec2.internal:2379 --ca-file=/etc/origin/master/master.etcd-ca.crt --cert-file=/etc/origin/master/master.etcd-client.crt --key-file=/etc/origin/master/master.etcd-client.key watch -r -f / | grep '.' | grep -v -i '"kind":"event"' | sort | uniq -c | sort -n

What makes yours so great?

smarterclayton · 2017-06-13T03:29:29Z

It's Eric in a box and I don't have to explain it

eparis · 2017-06-13T12:49:06Z

I do actually dislike inscrutable ENV vars. It doesn't look easy to make the timer a flag, but if you can find a way, it would make it a WHOLE lot more discoverable.

smarterclayton · 2017-06-13T15:50:03Z

Unfortunately we don't have any infrastructure for that today in diagnostics. I went back and forth - the reason i went with env var is that 90% of the time the default is enough. However, if you wanted to get a shorter run (test cases) or longer run (less bursty environments) then you'd have to recompile. So env var is more for the skilled user in that case. I agree that it's not ideal.

smarterclayton · 2017-06-14T13:39:26Z

Any other comments? I would like to have this tool available, and agree adding args in the future to this would be good.

eparis · 2017-06-14T13:59:14Z

[merge]
[test]

openshift-bot · 2017-06-14T14:22:08Z

continuous-integration/openshift-jenkins/merge Waiting: You are in the build queue at position: 18

openshift-bot · 2017-06-14T14:22:09Z

Evaluated for origin merge up to f74b38b

smarterclayton · 2017-06-15T01:35:15Z

[severity:bug]

New EtcdWriteVolume diagnostic measures the number of writes in a time period to determine where significant write volume is going.

openshift-bot · 2017-06-15T03:30:08Z

Evaluated for origin test up to 68e1ede

openshift-bot · 2017-06-15T05:43:15Z

continuous-integration/openshift-jenkins/test SUCCESS (https://ci.openshift.redhat.com/jenkins/job/test_pull_request_origin/2249/) (Base Commit: 76c0850)

smarterclayton added this to the 3.6.0 milestone Jun 12, 2017

smarterclayton mentioned this pull request Jun 14, 2017

Support optional args in diagnostics #14640

Closed

smarterclayton closed this Jun 15, 2017

smarterclayton reopened this Jun 15, 2017

Report the volume of etcd writes via a diagnostic

68e1ede

New EtcdWriteVolume diagnostic measures the number of writes in a time period to determine where significant write volume is going.

smarterclayton force-pushed the etcd_watch branch from f74b38b to 68e1ede Compare June 15, 2017 03:23

smarterclayton merged commit 66adcf8 into openshift:master Jun 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report the volume of etcd writes via a diagnostic #14604

Report the volume of etcd writes via a diagnostic #14604

smarterclayton commented Jun 12, 2017 •

edited

Loading

smarterclayton commented Jun 12, 2017

eparis commented Jun 13, 2017

smarterclayton commented Jun 13, 2017 via email

eparis commented Jun 13, 2017

smarterclayton commented Jun 13, 2017

smarterclayton commented Jun 14, 2017

eparis commented Jun 14, 2017 •

edited

Loading

openshift-bot commented Jun 14, 2017 •

edited

Loading

openshift-bot commented Jun 14, 2017

smarterclayton commented Jun 15, 2017

openshift-bot commented Jun 15, 2017

openshift-bot commented Jun 15, 2017

Report the volume of etcd writes via a diagnostic #14604

Report the volume of etcd writes via a diagnostic #14604

Conversation

smarterclayton commented Jun 12, 2017 • edited Loading

smarterclayton commented Jun 12, 2017

eparis commented Jun 13, 2017

smarterclayton commented Jun 13, 2017 via email

eparis commented Jun 13, 2017

smarterclayton commented Jun 13, 2017

smarterclayton commented Jun 14, 2017

eparis commented Jun 14, 2017 • edited Loading

openshift-bot commented Jun 14, 2017 • edited Loading

openshift-bot commented Jun 14, 2017

smarterclayton commented Jun 15, 2017

openshift-bot commented Jun 15, 2017

openshift-bot commented Jun 15, 2017

smarterclayton commented Jun 12, 2017 •

edited

Loading

eparis commented Jun 14, 2017 •

edited

Loading

openshift-bot commented Jun 14, 2017 •

edited

Loading