-
Notifications
You must be signed in to change notification settings - Fork 1
2022.05.09
CUGS presentation
Fallout: A Monitoring Infrastructure Supporting Informed System Acceptance
Putting up an infrastructure that will make the monitoring of the stand-up process easier and minimize the time spent during the testing and stand-up phase during factory and on-the-floor acceptance testing
Used to find outlier components: network links, processor thermals, memory utilization, CPU utilization, or any other components you have samplers for.
Constraints of standup and factory testing: Limited access to repo satisfying external software dependencies Limited ability to modify boot images Limited/missing access to shared/remote storage Vendor wanting minimal external influences
Utilizing Google graphs and Grafana to identify outlier behaviors and rule-violating behaviors. Uses similar aggregator and sampler setup as is usual with LDMS.
Link to presentation:
- Home
- Search
- Feature Overview
- LDMS Data Facilitates Analysis
- Contributing patches
- User Group Meeting Notes - BiWeekly!
- Publications
- News - now in Discussions
- Mailing Lists
- Help
Tutorials are available at the conference websites
- Coming soon!
- Testing Overview
- Test Plans & Documentation: ldms-test
- Man pages currently not posted, but they are available in the source and build
V3 has been deprecated and will be removed soon
- Configuring
- Configuration Considerations
- Running