Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custom dataset in the DW for capacity reporting #10

Open
brandubh opened this issue Jun 4, 2015 · 0 comments
Open

Custom dataset in the DW for capacity reporting #10

brandubh opened this issue Jun 4, 2015 · 0 comments

Comments

@brandubh
Copy link
Collaborator

brandubh commented Jun 4, 2015

Custom dataset in the DW for capacity reporting (tough one and time consuming)
[RB] what do you mean by custom dataset?
[DG] this is a long story :). When data is written into the data warehouse it either is added to a stanadrad dataset (alert, events, state, performance, alerts) or to a custom defined dataset (we have a few of them for Exchange, DPM and so forth). A dataset defines the data schema, aggregation and grooming model. Basically you need them when the data schema doesn’t fit the standard ones, or when you need a different aggregation model. Let’s say for example that we collect CPU usage for chargeback, we would need a sum of all the usage for a VM in a given period of time. Standard aggregations compute only avg, min, max, stddev while we need a sum for the period. Or let’s say we need some percentile for a measure, same story we need a custom dataset. This will take days to develop and tune and before that we should plan which measures we need and how we want them computed. Tough one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant