Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XXX Log to perfdb.dcol1.delphix.com by default #93

Draft
wants to merge 48 commits into
base: develop
Choose a base branch
from

Conversation

prakashsurya
Copy link
Contributor

No description provided.

brad-lewis and others added 30 commits November 19, 2019 15:08
* Fix backend io collector post 5.x kernel upgrade

* Check for kprobe existence before attaching

* Fix lint 1
port txg dtrace script to ebpf
estat: remove mandatory final positional duration argument
estat: too many arguments output when there are too few
* Fix usage message for zil program

* Fix arg parsing error message
…pends on python3-minimal instead of python-minimal (#67)
…text is missing documentation for arc_prefetch (#70)
…t needs to be fixed so it reports data correctly (#71)
brad-lewis and others added 18 commits December 21, 2021 13:41
…dependency to the performance-diagnostic package (#82)" (#84)

This reverts commit 8cd3f7a.
Initial addition of configuration and control files to enable
performance metric collection using the Telegraf agent.
See also IDEA-2835 : Improving Support Bundle Performance Metrics

Includes:
- Service definition and startup script for "delphix-telegraf"
- Modified version of "estat" adding JSON output via a "-j" option
- A "perf_playbook" wrapper script to enable/disable enhanced collection
- Configuration file sections (combined on startup)
- Simple wrappers to facilitate parsing of "nfs_threads", "zpool iostat -o",
and "zcache stats -a" outputs

The service starts with a "base" set of metrics, but will include Object Storage
metrics when it is detected, and will include Performance Playbook commands
if that has been enabled (manually). The config is reassembled each startup.

File paths intended:

/opt/delphix/server/bin/delphix-telegraf-service
/lib/systemd/system/delphix-telegraf.service
/etc/telegraf/nfs-threads.sh
/opt/delphix/server/bin/perf_playbook
/etc/telegraf/telegraf.base
/etc/telegraf/telegraf.inputs.dose
/etc/telegraf/telegraf.inputs.playbook
/etc/telegraf/zcache-stats.sh
/etc/telegraf/zpool-iostat-o.sh

This configuration records 4 output files (rotated on size) for main metrics,
aggregate statistics (min,max,mean,stddev) and Playbook outputs to enable
independent retention periods.
CP-8403 Adding Telegraf-based metric collection.
GHM-743  transition more platform repository to pre-gitflow branching model
DLPX-82298 Telegraf needs a restart delay for external programs
Added a 30s restart delay to external commads invoked via the "execd" plugin
…lection (#91)

DLPX-82457 Telegraf: add zio-queue and metaslab-alloc to playbook collection
Added entries for these two commands
@prakashsurya prakashsurya force-pushed the dlpx/pr/prakashsurya/aa0cbaa8-0bf0-465b-afb4-9cb388a22c8d branch from 802acf2 to 07c6709 Compare March 28, 2023 19:17
@prakashsurya
Copy link
Contributor Author

git-ab-pre-push -v internal-buildserver -p aws --no-tests is here

@grodr grodr changed the base branch from 6.0/stage to develop April 17, 2023 18:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

9 participants