Skip to content

Commit

Permalink
improve requirements and english issues
Browse files Browse the repository at this point in the history
  • Loading branch information
egekorkan committed Apr 3, 2024
1 parent 6ec1faa commit 746a8aa
Show file tree
Hide file tree
Showing 2 changed files with 60 additions and 42 deletions.
56 changes: 37 additions & 19 deletions toolchain/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,8 @@ The template style inputs are written mostly once in the beginning and are gener

## Detailed Current Process

The current W3C WoT specification generation process toolchain is depicted as a Business Process Modeling Notation (BPMN) diagram in the figure below. This is a result of years of work that have resulted in different tools that require high manual effort to keep everything synced. The future goal is to switch to an easier-to-maintain toolchain that is usable by people without a deep understanding of the technologies involved.
The current W3C WoT specification generation process toolchain is depicted as a Business Process Modeling Notation (BPMN) diagram in the figure below. This is a result of years of work that have resulted in different tools that require high manual effort to keep everything synced.
The future goal is to switch to an easier-to-maintain toolchain that is usable by people without a deep understanding of the technologies involved.

![Toolchain BPMN](./wot-toolchain-bpmn.png)

Expand All @@ -21,36 +22,53 @@ There are 4 actors involved:
3. **_SPARQL Template Transformation Language Engine_**
4. **_Parser Creation_**

The Semantic Web Expert has the main responsibility to iteratively define the schema of different TD model specifications such as TD core, TM, WoTsec, HCTL or any extensions required. Also, the JSON Schema definition is defined for syntactical validation of JSON-LD data and Shapes Constraint Language (SHACL) shapes for validating RDF graphs. These different tasks are performed in an adhoc manner until agreement is achieved among the experts. In addition, the experts define a set of transformations for the specification generation according to the SPARQL Template Transformation Language (STTL). The templates expressed in STTL describe different rules for transforming a source RDF graph into a result text.

The WoT Toolchain is a shell script (render.sh), which manages the interaction between different Node.js scripts developed to achieve different objectives. It receives the different JSON-LD context files for each vocabulary in the TD model and aggregates them into one. The resulting JSON-LD context is then converted into RDF Turtle representation format. The artifacts provided by the Semantic Web Experts and the RDF graphs are provided as input to the STTL Engine.

The STTL engine takes different RDF graphs: TD model ontologies, SHACL shapes (validation/td-validation.ttl), merged context (context/td-context.ttl), a transformation (a set of templates), and a specific starting node that determines the starting HTML node pointing to the WoT Things Descriptions as a set of command-line arguments. The TD model ontologies and SHACL shapes are loaded and stored in an RDF store for semantic processing and querying. Then, the STTL templates are parsed according to the parser developed by the Parser Creation actor. This is followed by matching the input HTML node with one of the transformation names specified in the template. A template in a transformation may apply other templates or include different utility functions or SPARQL functions. Therefore, the Apply Template sub-process identifies these different cases and applies the rules which includes the following activities: evaluate patterns, variables and expressions in a template, construct the query to execute using a JSON format, and executing the query on the RDF graph. The output is generated HTML textual output format corresponding to the index.html file of the TD specification and the different ontology specifications.

The TD specification include diagrams that are also generated from the same SHACL and merged context sources. The STTL Engine is run once more to generated diagrams in textual form (DOT text files) and then turned into graphics using Graphviz tool. The output is the SVG figures of the TD specification, which is then manually converted into PNG using a desired tool of choice. These PNG files are used to feed the HTML specification.

The Parser Creation process is mainly performed by a developer using a 2-step process. The grammar is developed manually using to context-free grammar according to the semantics of STTL language. Then, Jison tool takes the grammar as input and outputs a JavaScript file capable of parsing the language described by that grammar. The generated script is then used to parse visualization template inputs in the STTL Engine.
The Semantic Web Expert has the main responsibility to iteratively define the schema of different TD model specifications such as TD core, TM, WoTsec, HCTL, or any extensions required.
Also, the JSON Schema definition is defined for syntactical validation of JSON-LD data and Shapes Constraint Language (SHACL) shapes for validating RDF graphs.
These different tasks are performed in an ad-hoc manner until agreement is achieved among the experts.
In addition, the experts define a set of transformations for the specification generation according to the SPARQL Template Transformation Language (STTL).
The templates expressed in STTL describe different rules for transforming a source RDF graph into a result text.

The WoT Toolchain is a shell script (render.sh), which manages the interaction between different Node.js scripts developed to achieve different objectives.
It receives the different JSON-LD context files for each vocabulary in the TD model and aggregates them into one.
The resulting JSON-LD context is then converted into RDF Turtle representation format.
The artifacts provided by the Semantic Web Experts and the RDF graphs are provided as input to the STTL Engine.

The STTL engine takes different RDF graphs: TD model ontologies, SHACL shapes (validation/td-validation.ttl), merged context (context/td-context.ttl), a transformation (a set of templates), and a specific starting node that determines the starting HTML node pointing to the WoT Things Descriptions as a set of command-line arguments.
The TD model ontologies and SHACL shapes are loaded and stored in an RDF store for semantic processing and querying. Then, the STTL templates are parsed according to the parser developed by the Parser Creation actor.
This is followed by matching the input HTML node with one of the transformation names specified in the template.
A template in a transformation may apply other templates or include different utility functions or SPARQL functions. Therefore, the Apply Template sub-process identifies these different cases and applies the rules which include the following activities: evaluate patterns, variables, and expressions in a template, construct the query to execute using a JSON format, and execute the query on the RDF graph.
The output is generated in HTML textual output format corresponding to the index.html file of the TD specification and the different ontology specifications.

The TD specification includes diagrams that are also generated from the same SHACL and merged context sources.
The STTL Engine is run once more to generate diagrams in textual form (DOT text files) and then turned into graphics using Graphviz tool.
The output is the SVG figures of the TD specification, which is then manually converted into PNG using a desired tool of choice.
These PNG files are then fed into the HTML specification.

The Parser Creation process is mainly performed by a developer using a 2-step process.
The grammar is developed manually using context-free grammar according to the semantics of STTL language.
Then, Jison tool takes the grammar as input and outputs a JavaScript file capable of parsing the language described by that grammar.
The generated script is then used to parse visualization template inputs in the STTL Engine.

## Requirements for Toolchain

Before working on the restructuring of the toolchain to make it simpler, we can write the requirements of the toolchain
Before working on the restructuring of the toolchain to make it simpler, we can write the requirements of the toolchain
by listing what kind of inputs we want to work on and what kind of outputs we want to provide to the users (reader, implementer, spec writers).

**Overall Goal:** Change a small number of files (source of truth) but generate as many resources as needed by users.

**Compromise:** Reduce the amount of cross-checking between files by a human expert. In this case, everything is not automatically generated.

## General Requirements for Toolchain
### General Requirements for Toolchain

1. Someone without expertise in our tooling should be able to make contributions. This is especially relevant for binding contributions from experts of the protocol who are not experts of the semantic web or the type of tooling we have. Thus, the learning curve should not be steep at all.
1. Relying on well-known tools would make it easier. Templating engines like [Handlebars](https://handlebarsjs.com/) are well-known. STTL is not well-known and doesn't support all the features we need.
2. More documentation about the entire repository configuration and tooling (Actions, hooks, npm scripts to run at some point etc.)
1. Someone without expertise in our tooling should be able to make contributions. This is especially relevant for binding contributions from experts of the protocol who are not experts on the semantic web or the type of tooling we have. Thus, the learning curve should not be steep at all.
1. Relying on well-known tools would make it easier. Templating engines like [Handlebars](https://handlebarsjs.com/) are well-known. STTL is not well-known and doesn't support all the features we need.
2. More documentation about the entire repository configuration and tooling (Actions, hooks, npm scripts to run at some point etc.)
2. It should be easy to debug/observe the process.
3. If multiple tools are "chained together", we should ensure that the inputs and outputs match. An example would be if the generated JSON Schema is badly structured, it would result in cryptic TS definitions.
4. Not mixing up multiple languages in one resource file (currently, we have HTML snippets in SHACL shapes).
5. (Optional) The tooling should be easy to install and understand the inner workings.

## Output Resources
### Output Resources

Output resources are given to the users of the specification for various usages.

Expand All @@ -63,7 +81,7 @@ The following resources are updated each time the "source of truth" in the input
- JSON-LD Context
- JSON Schemas: For TD and TM
- SHACL Shapes
- Abstraction or Interfaces for Programming Languages:
- Abstraction or Interfaces for Programming Languages:
- TypeScript definitions: Currently done via Scripting API TF
- We need to clarify it for other languages
- Test cases: Each feature has its test case(s). Currently in Eclipse Thingweb but will be moved here
Expand All @@ -75,9 +93,9 @@ The following resources are updated less frequently, generally in publication ph
- UML Class Diagrams: Beautified versions of the "always synced" diagrams
- Users: Adopters of the standard (to give an overview)

## Input Resources
### Input Resources

Input resources, referred to as source of truth, are what the TD Task Force writes and maintains to generate the output resources for the users.
Input resources, referred to as the source of truth, are what the TD Task Force writes and maintains to generate the output resources for the users.

The following resources are updated for each change:

Expand Down
46 changes: 23 additions & 23 deletions toolchain/tool-analysis.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,23 +5,23 @@ The current W3C WoT specification generation tool relies on various WoT artifact
Therefore, the aim is to identify a set of potential tools that can simplify the generation of the WoT artifacts. An analysis of existing tools has been conducted based on different requirements, which are determined by the modeling features crucial for the TD information model.
The table below provides a summary of the analyzed tools according to the defined requirements.

| Requirement \| Tool | LinkML | TreeLDR | ESMF | A.ML | Schema Salad | SOML | WIDOCO |
|-------------------------|----------|----------|----------|----------|--------------|----------|----------|
| Language | Python | Rust | Java | Scala | Python | NG | Java |
| Object/Dict Support | | | | | | | |
| Condition Check | | | | | | | |
| Array Support | O | O | O | O | O | O | X |
| One of | O | O | X | O | O | X | X |
| Type/Type[] | X | X | X | X | X | X | X |
| Inheritance | O | O | O | O | O | O | X |
| Unknown object keys | O | X | X | O | X | X | X |
| Pattern Matching | O | O | O | O | X | O | X |
| JSON Schema Generation | O | O | O | X | X | X | X |
| SHACL Shapes Generation | O | X | X | X | X | X | X |
| Term Documentation | O | X | X | X | O | X | O |
| Diagram Generation | O | X | O | X | O | X | O |
| Prog. Lang. Gen. | | | | | | | |
| Extensibility | | | | | | | |
| Requirement \| Tool | LinkML | TreeLDR | ESMF | A.ML | Schema Salad | SOML | WIDOCO |
| ----------------------- | ------ | ------- | ---- | ----- | ------------ | ---- | ------ |
| Language | Python | Rust | Java | Scala | Python | NG | Java |
| Object/Dict Support | O | | | | | | |
| Condition Check | | | | | | | |
| Array Support | O | O | O | O | O | O | X |
| One of | O | O | X | O | O | X | X |
| Type/Type[] | X | X | X | X | X | X | X |
| Inheritance | O | O | O | O | O | O | X |
| Unknown object keys | O | X | X | O | X | X | X |
| Pattern Matching | O | O | O | O | X | O | X |
| JSON Schema Generation | O | O | O | X | X | X | X |
| SHACL Shapes Generation | O | X | X | X | X | X | X |
| Term Documentation | O | X | X | X | O | X | O |
| Diagram Generation | O | X | O | X | O | X | O |
| Prog. Lang. Gen. | O | | | | | | |
| Extensibility | O | | | | | | |

Explanations:

Expand All @@ -33,9 +33,9 @@ Explanations:
- Inheritance: A class inheriting another one and thus copying all its properties. E.g. property affordance inheriting data schema
- Unknown object keys: An object whose property keys are not defined by the model. E.g. all affordance names, securityDefinition keys.
- Pattern Matching: A string value with a regex pattern. E.g. `tm:optional` being restricted to affordance names only.
- JSON Schema Generation:
- SHACL Shapes Generation:
- Term Documentation:
- Diagram Generation:
- Prog. Lang. Gen.:
- Extensibility:
- JSON Schema Generation: Generation of JSON Schemas to validate TD and TM instances.
- SHACL Shapes Generation: SHACL Shapes to validate TDs which are serialized into RDF Graphs.
- Term Documentation: Human-readable text to help understanding of terms
- Diagram Generation: UML Class diagrams to help illustrate the information model
- Prog. Lang. Gen.: Generation of assets for various programming languages such as types in TypeScript.
- Extensibility: Extending the tool to offer other output formats in the future.

0 comments on commit 746a8aa

Please sign in to comment.