-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[graph] visualization to indicate where a dataflow is stuck #14
Comments
Any more thoughts here? |
The closest is an open issue in the timely repo for logging progress computation, which (ideally) would allow one to track where there remains outstanding work (usually that incriminates some operators). It is languishing a bit for lack of requirements (it was formed in support of a research project that wanted lots of information, but should we actually aim at minimizing the information to e.g. the frontier of available work?). |
When I think about the points in time that I'd want to use a feature like this, there are two groups of situations that come to mind: 1. a computation that does complete, or at least makes progress, but is behaving unexpectedly(too fast, or too slow) for the current data set. 2. computation that like the author mentions, just gets 'stuck' unexpectedly, which usually is when a larger input set is used vs the development data set. For the first group of cases, it's generally been on a computation that has been through many development iterations, and is in need of a refactor or a wholistic review. Being able to visualize the computational flow and some info about the relative memory and cpu resources for each stage would be both useful and actionable. In my experiences it's often been when an incorrect variable is being used, and the variable names are too close to each other to make it obvious in a code review. For the second case, I'm not sure what would be most helpful. Being able to see something about the available work seems it would be actionable in tracking down the issue, but at the same time being able to see how much of the computation remains(if there are 10 stages, and only the operators for stage 2 and 3 are available, that would be helpful to narrow down). I get the sense that seeing some approximation for what resources would be needed for each of the operators, based on some sample(or profiled) data set, would point out clearly where there is an unexpected order(s) of magnitude difference. Not sure if the above are possible. |
@frankmcsherry:
The text was updated successfully, but these errors were encountered: