These reference examples show how RDF instances of the IDS Information Model can be validated with SHACL either via CLI or using existing programming libraries.
The cli subdirectory demonstrates the SHACL validation using CLI tools. Note that this validation approach, with the tools presented here, can only be done after performing the steps described in Preparing the SHACL Shapes and Preparing the RDF instance either manually or using other tools beforehand.
The java subdirectory contains a code example of a small SHACL validator implementation, based on the Apache Jena library for Java. The example program provides programmatic functionalities to do most of the steps described in Validating RDF instances using SHACL automatically.
Below, the basic functionality of SHACL as well as its intended usage is described.
The IDS Information Model is a domain-agnostic RDF/OWL ontology and an essential agreement shared by the participants and components of the International Data Spaces (IDS). Participants of the IDS use the RDF vocabulary provided by the Information Model as their common language within the IDS. To ensure the correct usage and understanding of the vocabulary, validation structures are provided in the W3C Shapes Constraint Language (SHACL). These so-called SHACL shape graphs can be used to validate self-generated RDF statements against the Information Model and check if, e.g.,:
- a Connector's self-description is valid
- an
ids:Resource
is described using the right metadata terms - an HTTP multipart message exchanged between IDS components provides the necessary information
The SHACL shapes can be found in the testing subdirectory of the IDS Information Model.
The tools and methods provided here demonstrate the validation of JSON-LD RDF statements since JSON-LD is currently the most widely used RDF serialization in the IDS.
SHACL shapes can formulate constraints on IDS Information Model classes and properties. For example, the following Connector SHACL shape states that a Connector must have exactly one property called ids:securityProfile
, which additionally refers to an instance of the class ids:SecurityProfile
. If this constraint is not satisfied, it is treated as a violation. Using the tools and frameworks described below, one can validate an RDF instance of the ids:Connector
class – in other words: a Connector's self-description – against this shape. The W3C documentation of SHACL provides additional information on the functionality of SHACL.
shapes:ConnectorShape
a sh:NodeShape ;
sh:targetClass ids:Connector ;
sh:property [
a sh:PropertyShape ;
sh:path ids:securityProfile ;
sh:class ids:SecurityProfile ;
sh:minCount 1 ;
sh:maxCount 1 ;
sh:severity sh:Violation ;
] .
Before an RDF instance (also called data graph) can be validated against SHACL shapes, two main conditions must be fulfilled.
To fully validate a data graph against SHACL, one needs to get all relevant SHACL shapes for that particular instance. The provided SHACL shape files are class-specific, therefore each class in the IDS Information Model has a corresponding file with SHACL shapes. Class-specific SHACL files leads to a better overall structure and enables easy expansion in the future.
To validate, e.g., a Connector self-description, one creates the corresponding SHACL shapes for that RDF instance. In this example, one would have to retrieve the SHACL shapes for the ids:Connector
class, its superclasses as well as all classes that are in relation to the Connector class via RDF properties.
To solve this issue, we recommend to concatenate all SHACL shapes in the testing subdirectory of the respository into a single shapes graph or file, respectively.
In addition to the preparation of the SHACL shapes graph, the RDF instance may need some additional semantic informationen such as rdfs:subClassOf
class inheritance informations. For example, the ids:Connector
class is a subclass of the ids:ManagedEntity
class. To enable SHACL to validate those properties, which the ids:Connector
RDF instance inherits from its ids:ManagedEntity
superclass, the correct axioms must be added to the data graph. This is due to the fact that SHACL shapes do not cover these kind of semantics themselves.
A simple solution is to append the whole IDS Information Model ontology to the RDF instance. This ensures that the data graph contains all necessary relational information. In practice, this approach has been sufficiently efficient so far.
After performing the above-mentioned steps, the RDF instance can be validated against the SHACL shape using a SHACL processor such as those mentioned initially.