-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
df stats #2
Comments
@fredtriplefred I'm sure it wouldn't be too difficult. We have traditional nagios monitoring which has covered that aspect for us so it never came up. This is definetly something that should be included. I'll try to dig into that soon. |
@fredtriplefred I have released v1.8.0 with support for node_filesystem_X metrics. Unfortunately we are in a change freeze at work so I can't test it out too much, but what I have tested seems to be file. I would appreciate it if you could try to deploy this new version and see if all the numbers add up. |
Thanks Thors it works perfectly 👍 I have integrated it to Grafana and alert system Another request if possible. Thanks and regards |
@fredtriplefred No, sorry, I don’t have access to anything below 7.1. If you install the packages in the readme you should be able to build it your self. It would actually be an interesting experiment to see if there are any issues. I have some graphs I could share soon. Take the info with a grain of salt, I may have misinterpreted some of the metrics. The cpu stats are especially iffy. But, they do reflect the trends :) |
@fredtriplefred I uploaded the dashboard that I use most frequently. It just went through some changes so I hope all the calculations match up. Give it a whirl and create issues if you find any. This was exported from grafana v6.5.1. |
Hi thors !
|
It seems Serial Number for this AIX bad formatted and be the source of error : HELP node_load1 1m load average.TYPE node_load1 gaugenode_load1{machine_serial="/?? ^B#0 ^DZp",lpar="saveprod",group_id="32773"} 2.99446 HELP node_load5 5m load average.TYPE node_load5 gaugenode_load5{machine_serial="/?? ^B#0 ^DZp",lpar="saveprod",group_id="32773"} 4.5938 Which is the command used to extract it ? Regards |
This is coming from the libperfstat library. I've has issues if the system tools are not being used, for example if /opt/freeware/bin is ahead of /use/bin. See if you can run with bog-standard PATH and LIBPATH. |
yes surely in relation with context environment but it works (with just an error on diskadapter) if executed manually and not as a service : so may be rather in context around the service ? Good week-end |
@fredtriplefred Could you give v1.10.0 a go? I'm trying to set PATH and LIBPATH to some sane values on startup to see if that helps. It works correctly if I try to start it up using a PATH string that had issues previously, so hopefully it just works now. |
Hello Thorhs |
@fredtriplefred Hmmm... that is odd. Unfortunately I don't have any control over the libperfstat, and how it finds the machine serial number. By running a trace on the process, it seems like this is command is being executed by the libperfstat library to get the machine serial number:
I have set the path to system only directories, and emptied the LIBPATH so there should be no outside influence. What i find most peculiar is that the command works on the command line, but fails in SRC. If you run the above command, what is the output? On my end, I get:
What version of AIX is this LPAR running? I could add a flag to manually set the machine_serial, if that would be an acceptable solution, or even read it from a file in /etc/sysconfig. |
yes same results as you, it works in command line : Which cpu_pool_id references ? Regards |
Ok. The cpupool_id is what is returned from the perfstat_partition_config, I have not fully investigated it this should be the shared processor pool the LPAR is in. If you are not using shared processor pools, this is probably 0 for all LPARs. I'm hoping I can use this to graph up the total CPU used per pool, as well as the free capacity. |
there is only information about volume group and not capacity filesystem :
TYPE aix_disk_free gauge
aix_disk_free{disk="hdisk1",vgname="ngamsoft",machine_serial="21475DV",lpar="GAMAY",group_id="32772"} 49664
aix_disk_free{disk="hdisk0",vgname="rootvg",machine_serial="21475DV",lpar="GAMAY",group_id="32772"} 61824
Is it possible to add "df" information or other to follow capacity of space disk ?
Regards
Frederic
Originally posted by @fredtriplefred in #1 (comment)
The text was updated successfully, but these errors were encountered: