-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Streaming Aggregators to realise data summarisation on data streams stored in a Solid environment #84
Comments
A few papers that I came across might be relevant to sliding windows aggregations:
The author, Jonas Traub, developed the aggregate stream slicing in that specific order |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Aggregation of the data streams generated from a single LDES in LDP solid pod works. Working towards aggregation for multiple different pods. |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Currently defining an ontology and using websockets to publish the results from the aggregation. |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
A repository for the demo of the aggregator is avaible at https://github.com/argahsuknesib/ssa-demo/ |
Not all acceptance criteria are met at the moment. |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Currently, the testing of the aggregator system is not done. In the next step, testing / benchmarking will be done. The challenge will be closed, with the results from those benchmarks. |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Currently preparing a testplan / test environment for the aggregator's testing. |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Preparing the test setup and writing scripts for testing. The evaluation repository (which will be updated, later with finished results) is available here https://github.com/argahsuknesib/solid-stream-aggregator-evaluation |
Please provide a status update about this challenge. Every ongoing challenge needs at least one status update every 2 weeks. Thanks! |
Pitch
Data streams are becoming omnipresent, however, storing and analysing real-time data streams in a decentralised fashion using solid is still hard to achieve. This is mainly due to the high frequency of changes in the answers to the issued queries on these streams and the temporal validity of the answers.
A first prototype of streaming aggregators is necessary to prepare the answers of a continuous query over streaming data for a client and keep the query results up to date. This eliminated the need for the client to process the whole stream while the aggregator allow the client to retrieve the results instantaneously.
In patient monitoring system, data streams produced by personal vitality sensors and activity trackers are semantically annotated and stored in the data pods. Healthcare providers are interested in summaries of the activity of a single patient our summaries across multiple patients.
Streaming aggregators are required to realise an improved data summarisation and instantaneous results as the data to be analysed in a pull-based fashion is extremely large due to the continuous dimension of the data streams.
The DAHCC Dataset will be used as the data stored for each patient in a solid pod to realise the aggregators.
Desired Solution
A first proof of concept streaming aggregator which runs as a service, with whom a client application can interact. The client application can specify the query that needs to be continuously evaluated on one or multiple data streams stored in Solid Pods. The solution is required to,
Use Case
The dataset has sensor values from multiple patients. To monitor the patient's location, we use the sensors which detects the presence of the person in the house. The person detection sensor is employed in the 3 halls, kitchen and the bedroom in the DAHCC dataset. We will aggregate each patient's location in a particular window, as well as the location of all the patients. This allows to compute a summary of the activity of each patient, which is a useful insight for healthcare providers.
Acceptance Criteria
A demo resulting from the solution should be able to,
Assumptions
Compared to (#24), we focus on the streamming and windowing aspect for aggregation of data.
Scenarios
This is part of a larger scenario
The text was updated successfully, but these errors were encountered: