Intermediate result sharing among data streams for aggregation #104

argahsuknesib · 2023-04-20T08:26:57Z

Pitch

This challenge is an extension of the challenge on query containment. Upon the completion of the query containment challenge, we will have an algorithm to determine if the registered query is contained in an already registered query. To improve the scalability of the solid stream aggregator, it is crucial to share resources between the streams. The queries are similar but different queries over the data.

Desired solution

The desired solution is an algorithm / approach to use the similarities in the queries over multiple streams. In streaming scenarios, the data stream is chopped up into a particular window for processing over. Therefore, the common data over which the queries differs on the size of the window over the two queries. The approach of sharing should be able to share resources over the following scenarios:

Window	Queries
Same	Different
Different	Same
Different	Different

Acceptance criteria

The acceptance criteria for this challenge is to implement the sharing of resources between the streams in the solid stream aggregator and show the improvement in query execution time when comparing the execution time of the queries with and without sharing of resources. The data set used for the evaluation is the DAHCC dataset.

Pointers

As the topic of aggregation is still a novel research topic, a number of assumptions were taken:

Long term server-side authenticated sessions have been solved and therefore the authentication part of this challenge is not taken into account.

Scenarios

The challenge is part of a larger scenario on Aggregated view on sensitive personal health data streams. The scenario is described in issue 16

pheyvaer · 2023-04-20T08:43:06Z

@pbonte Once you are doing with the review of the challenge, can you assign it to me? Thanks!

argahsuknesib added challenge technical problem applied to a use case proposal: pending ❓ labels Apr 20, 2023

argahsuknesib assigned RubenVerborgh and pbonte Apr 20, 2023

pheyvaer assigned pbonte and pheyvaer and unassigned RubenVerborgh, pbonte and pheyvaer Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intermediate result sharing among data streams for aggregation #104

Intermediate result sharing among data streams for aggregation #104

argahsuknesib commented Apr 20, 2023 •

edited

Loading

pheyvaer commented Apr 20, 2023

Intermediate result sharing among data streams for aggregation #104

Intermediate result sharing among data streams for aggregation #104

Comments

argahsuknesib commented Apr 20, 2023 • edited Loading

Pitch

Desired solution

Acceptance criteria

Pointers

Scenarios

pheyvaer commented Apr 20, 2023

argahsuknesib commented Apr 20, 2023 •

edited

Loading