Skip to content

Latest commit

 

History

History
 
 

rdf-validation-using-shacl

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

SHACL Validation

Scenario

These reference examples show how RDF instances of the IDS Information Model can be validated with SHACL either via CLI or using existing programming libraries.

Contents

The cli subdirectory demonstrates the SHACL validation using CLI tools. Note that this validation approach, with the tools presented here, can only be done after performing the steps described in Preparing the SHACL Shapes and Preparing the RDF instance either manually or using other tools beforehand.

The java subdirectory contains a code example of a small SHACL validator implementation, based on the Apache Jena library for Java. The example program provides programmatic functionalities to do most of the steps described in Validating RDF instances using SHACL automatically.

Below, the basic functionality of SHACL as well as its intended usage is described.


Why SHACL ?

The IDS Information Model is a domain-agnostic RDF/OWL ontology and an essential agreement shared by the participants and components of the International Data Spaces (IDS). Participants of the IDS use the RDF vocabulary provided by the Information Model as their common language within the IDS. To ensure the correct usage and understanding of the vocabulary, validation structures are provided in the W3C Shapes Constraint Language (SHACL). These so-called SHACL shape graphs can be used to validate self-generated RDF statements against the Information Model and check if, e.g.,:

  • a Connector's self-description is valid
  • an ids:Resource is described using the right metadata terms
  • an HTTP multipart message exchanged between IDS components provides the necessary information

The SHACL shapes can be found in the testing subdirectory of the IDS Information Model.

The tools and methods provided here demonstrate the validation of JSON-LD RDF statements since JSON-LD is currently the most widely used RDF serialization in the IDS.

Basic Functionality of SHACL

SHACL shapes can formulate constraints on IDS Information Model classes and properties. For example, the following Connector SHACL shape states that a Connector must have exactly one property called ids:securityProfile, which additionally refers to an instance of the class ids:SecurityProfile. If this constraint is not satisfied, it is treated as a violation. Using the tools and frameworks described below, one can validate an RDF instance of the ids:Connector class – in other words: a Connector's self-description – against this shape. The W3C documentation of SHACL provides additional information on the functionality of SHACL.

shapes:ConnectorShape
  a sh:NodeShape ;
  sh:targetClass ids:Connector ;
  sh:property [
    a sh:PropertyShape ;
    sh:path ids:securityProfile ;
    sh:class ids:SecurityProfile ;
    sh:minCount 1 ;
    sh:maxCount 1 ;
    sh:severity sh:Violation ;
  ] .

Validating RDF instances using SHACL

Before an RDF instance (also called data graph) can be validated against SHACL shapes, two main conditions must be fulfilled.

1. Preparing the SHACL Shapes

To fully validate a data graph against SHACL, one needs to get all relevant SHACL shapes for that particular instance. The provided SHACL shape files are class-specific, therefore each class in the IDS Information Model has a corresponding file with SHACL shapes. Class-specific SHACL files leads to a better overall structure and enables easy expansion in the future.

To validate, e.g., a Connector self-description, one creates the corresponding SHACL shapes for that RDF instance. In this example, one would have to retrieve the SHACL shapes for the ids:Connector class, its superclasses as well as all classes that are in relation to the Connector class via RDF properties.

To solve this issue, we recommend to concatenate all SHACL shapes in the testing subdirectory of the respository into a single shapes graph or file, respectively.

2. Preparing the RDF instance

In addition to the preparation of the SHACL shapes graph, the RDF instance may need some additional semantic informationen such as rdfs:subClassOf class inheritance informations. For example, the ids:Connector class is a subclass of the ids:ManagedEntity class. To enable SHACL to validate those properties, which the ids:Connector RDF instance inherits from its ids:ManagedEntity superclass, the correct axioms must be added to the data graph. This is due to the fact that SHACL shapes do not cover these kind of semantics themselves.

A simple solution is to append the whole IDS Information Model ontology to the RDF instance. This ensures that the data graph contains all necessary relational information. In practice, this approach has been sufficiently efficient so far.

3. Run validation

After performing the above-mentioned steps, the RDF instance can be validated against the SHACL shape using a SHACL processor such as those mentioned initially.