INetCEP is a research project that enables in-network processing of CEP operators on the data plane of an Information-centric networking (ICN) architecture. Particularly, this project provides a means to process CEP queries inside the network (ICN routers) for delivering the complex events in an efficient manner. By processing events in the data plane, we can fulfill the demands of latency-critical applications. In our publication [2], we show that CEP queries can be processed in the order of a few microseconds using the power of programmable networks, which is significantly faster than the SOA systems such as Flink. Specifically, this project contributes the following:
- Unified communication model to enable continuous processing of data streams in Information-centric networking architecture.
- support both push- and pull-based traffic
- handling on the new packets in ICN architecture
- A meta query language to express CEP queries in the ICN data plane.
- support for standard CEP operators such as windows, joins, filters and aggregators
- operator abstraction to make it highly extensible
- CEP Query Engine that parses, places and executes the queries in the data plane of ICN.
- concurrent execution of queries and query reuse
- centralized and decentralized operator placement algorithms
To run INetCCEP simply do bash publishRemotely.sh all <query> 1 Centralized scala <time_in_seconds> 1200
Here, 1200
indicates the time in seconds for emulation, in this example 20 minutes.
- Copy your public key to the remote machine using
copyKeys.sh
script - Install dependencies and copy the binaries by
bash publishRemotely.sh install
- Set environment variables using
setEnvironment.sh
script
There are two options to start INetCEP system. Basically Option 1 automates the entire process of option 2. Note: Follow only one of the below options.
To start INetCEP on the cluster of resources (see VMS.cfg), please follow the steps below:
Setup on VM compute resources or use CORE emulation as described here
- create your personalized VM configiration file
VMS.cfg
(refer VMS_MAKI1.cfg and VMS_MAKI2.cfg) for your user and IP addresses. - auto generate node data for the respective machines with the following topology:
(3) -- (7)
|
(1) -- (2) -- (5)
| |
(4) (6)
using python generate_node_info.py
. Note: this script uses VMS.cfg as input for IP address information. Ports used are 9001, 9001,..,9001+n. (n: number of nodes).
- generate the Rspec file to request resources on GENI using
python generate_rspec.py <number of nodes> <out dir>
- upload the rspec, select site and reserve resources.
- download the manifest rspec file with IP address and port information.
- auto generate VMS.cfg file (refer VMS_GENI.cfg) and node data using
python manifest_to_config.py <manifest.xml>
After the above setup execute the publishRemotely.sh script that publishes the application on either of the above resources based on the VMS.cfg file. Refer publish_scripts/publishRemotely.sh
script. Usage:
bash publish_scripts/publishRemotely.sh all "QueryPlacement" 20
Here QueryPlacement is an element of set {"QueryCentralFixed", "QueryCentralLocalNS", "QueryCentralRemNS", "QueryDecentral", "QueryDecentralFixed", "QueryRandom", "QueryRandomLocalNS", "QueryRandomRemNS"} that are currently available placement services
Starting up nodes requires the following:
- Startup of the ccn-lite relay
- Linking a nfn-scala compute server with that relay
- Starting the node state update service that manages the node state and the query service
-
Starting ccn-lite relay on a network node (X) requires us to execute the startup script for that specific node. Let's take nodeID 28 as our sample node for the rest of this tutorial.
cd INetCEP/VM-Startup-Scripts/VM28/ ./startNodes.sh
Inside the VM28 folder, the startNodes script resides. This script initializes a node, sets up the required faces and adds those faces to the node. For each node in the network, the face configuration has been done to reflect the network topology presented in the report.
-
Starting the Compute Server is similar to a node. Make sure that you are in the same /VM28 folder and then execute the following script.
cd INetCEP/VM-Startup-Scripts/VM28/ ./startCS.sh
-
Starting the network discovery and query server is handled by one script that manages all sub-system tasks. These tasks include:
- Running the query server, that manages incoming consumer queries (stored in the query store) and re-evaluates them after a pre-defined time interval.
- Updating node state on local nodes. The node state is sent to the ccn-lite node that then sends it to the compute server. The compute server than adds this node state in its local content store.
To start decentralized query processing in the network, we have to pass the Query Type (Centralized, Random, Decentralized) and the interval of re-evaluation for the query store.
cd INetCEP/VM-Startup-Scripts/VM28/28
./startRemoteAny.sh QueryDecentral 20
The startRemoteAny.sh script takes the query type (QueryDecentral -> basically the placement algorithm) that the system has to manage and the interval (20) that the query service has to consider for re-evaluations.
In order to carry out query execution, we can access any node in the network and issue the following query. Here, any node in the network can act as a placement coordinator. Therefore, the query can be issued from any node to any node in the network.
$CCNL_HOME/bin/ccn-lite-peek -s ndn2013 -u 10.2.1.28/9001 -w 20 "" "call 8 /node/nodeA/nfn_service_QueryDecentral '1' 'Source' '9001' 'Client1' 'JOIN([name],[FILTER(name,WINDOW(victims,22:18:36.800,22:18:44.001),3=M&4>30,name)],[FILTER(name,WINDOW(survivors,22:18:35.800,11:11:45.000),3=F&4>20,name)],[NULL])' 'Region1' '22:10:11.200'/NFN" | $CCNL_HOME/bin/ccn-lite-pktdump -f 2
The above query issues a ccn-lite-peek interest to nodeID 28 for a decentralized query processing. The complex query is a join that works on two filters. The filters additionally evaluate the window operators on the streams after which the window data is filtered based on column schema represented by (1,2,3,4....n. Here 3=F means check whether the 3rd column contains F, if yes, select it). The result of each operator is fed to its parent operator, which once complete, is returned back to the consumer node.
This project is based on the CCN-lite project forked from here and NFN-scala. Major enhancements are enlisted here
[1] Manisha Luthra, Boris Koldehofe, Jonas Höchst, Patrick Lampe, A.H. Rizvi, Ralf Kundel, and Bernd Freisleben. “INetCEP: In-Network Complex Event Processing for Information-Centric Networking.” In: Proceedings of the 15th ACM/IEEE Symposium on Architectures for Networking and Communications Systems (ANCS). 2019, pp. 1–13. 10.1109/ANCS.2019.8901877 URL (extended version): arxiv.org/pdf/2012.05239.pdf
[2] Manisha Luthra, Johannes Pfannmüller, Boris Koldehofe, Jonas Höchst, Artur Sterz, Rhaban Hark, and Bernd Freisleben. “Efficient Complex Event Processing in Information-centric Networking at the Edge (under submission).” In: (2021), pp. 1–17. URL: arxiv.org/pdf/2012.05070.pdf
This work has been co-funded by the German Research Foundation (DFG) within the Collaborative Research Center (CRC) 1053 -- MAKI
Feel free to contact Manisha Luthra or Boris Koldehofe for any questions.