-
Notifications
You must be signed in to change notification settings - Fork 1
gpcdr kernel module
Cray's gpcdr kernel module is the source of the aries_linkstatus sampler metrics. It is also the source of Cray Gemini traffic and stall metrics in the cray_gemini_r_sampler. While it can be the source of the Cray Aries traffic and stall metrics in the cray_aries_r_sampler, it is recommended that you turn off these metrics in that sampler and instead get the traffic and stall metrics from the aries_nic_mmr and aries_rtr_mmr_samplers instead, which do not use gpcdr, but rather use the gpcd ioctl interface to read the counter data more efficiently (there is no way to get the linkstatus metrics via the gpcd interface, so gpcdr must be used for those metrics).
Install the following rpms (as of this writing, they may be in the smw rpms, but really need to be installed on the computes):
cray-gni-gpcdr-utils-6.0.25-6.0.5.0_3.28__gd019b74.ari.x86_64.rpm cray-gni-gpcdr-utils-man-6.0.25-6.0.5.0_3.28__gd019b74.ari.x86_64.rpm
One time only, you will need to start the service:
etc/init.d/gpcdr start
This will install the kernel module gpcdr_ari and set up the variable directories. After this, upon reboot the kernel module will automatically be installed and you will not need to run start again.
At this point, you can verify on a compute node that the link metrics are being properly exposed:
nid00028:/sys/devices/virtual/gni/gpcdr0/metricsets # more linksendstatus/metrics timestamp 1541537913199 ms sendlinkstatus:000 3 lanes sendlinkstatus:001 0 lanes sendlinkstatus:002 0 lanes sendlinkstatus:003 0 lanes sendlinkstatus:004 0 lanes sendlinkstatus:005 0 lanes sendlinkstatus:006 0 lanes ... # nid00028:/sys/devices/virtual/gni/gpcdr0/metricsets # more linkrecvstatus/metrics timestamp 1541537920797 ms recvlinkstatus:000 3 lanes recvlinkstatus:001 0 lanes recvlinkstatus:002 0 lanes recvlinkstatus:003 0 lanes recvlinkstatus:004 0 lanes recvlinkstatus:005 0 lanes recvlinkstatus:006 0 lanes ...
- Home
- Search
- Feature Overview
- LDMS Data Facilitates Analysis
- Contributing patches
- User Group Meeting Notes - BiWeekly!
- Publications
- News - now in Discussions
- Mailing Lists
- Help
Tutorials are available at the conference websites
- Coming soon!
- Testing Overview
- Test Plans & Documentation: ldms-test
- Man pages currently not posted, but they are available in the source and build
V3 has been deprecated and will be removed soon
- Configuring
- Configuration Considerations
- Running