Skip to content

Commit

Permalink
more tool updates
Browse files Browse the repository at this point in the history
  • Loading branch information
bedroesb committed Jun 23, 2023
1 parent db30edd commit fc4daf8
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions pages/tool_assembly/transmed_assembly.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,12 +36,12 @@ For projects involving patient recruitment the TransMed assembly provides the Sm
The transfer of data files can be done via various encrypted communication options as outlined in the [Data transfer](data_transfer) section of the RDMkit. The TransMed assembly most typically utilises (S)FTP, Aspera FASP and ownCloud. Data is also encrypted at rest with hard-ware and also with file-level encryption using either open-source utilities such as gpg or commercial options such as Aspera FASP.

### Data curation and harmonisation
To facilitate cross-cohort/cross-study interoperability of data, upon collection, the data needs to be curated and harmonised. For this purpose the TransMed assembly uses a variety of open standards and tools. For data quality and cleansing the assembly uses {% tool "openrefine" %}, which provides an intuitive interface to generate facets of data that support the research to identify quality issues and outliner. It also enables traceable and yet easy data correction. For data Extraction, Transformation and Loading (ETL) the assembly uses [Talend Open Studio](https://www.talend.com/) (for complex and reusable ETLs) as well as R and Python (for ad-hoc and simple transformation). To evaluate and improve FAIRness of datasets, the assembly follows the recipes in the {% tool "fair-cookbook" %} developed by the FAIRplus consortium. Related to standard data models and ontologies the assembly follows the recommendations in the FAIR Cookbook recipe for selecting terminologies and ontologies.
To facilitate cross-cohort/cross-study interoperability of data, upon collection, the data needs to be curated and harmonised. For this purpose the TransMed assembly uses a variety of open standards and tools. For data quality and cleansing the assembly uses {% tool "openrefine" %}, which provides an intuitive interface to generate facets of data that support the research to identify quality issues and outliner. It also enables traceable and yet easy data correction. For data Extraction, Transformation and Loading (ETL) the assembly uses {% tool "talend" %} Open Studio (for complex and reusable ETLs) as well as R and Python (for ad-hoc and simple transformation). To evaluate and improve FAIRness of datasets, the assembly follows the recipes in the {% tool "fair-cookbook" %} developed by the FAIRplus consortium. Related to standard data models and ontologies the assembly follows the recommendations in the FAIR Cookbook recipe for selecting terminologies and ontologies.

### Data integration and analysis
TransMed projects usually require different data types from different cohorts to be integrated into one data platform for the exploring, sub-setting and integrated analysis for hypothesis generation. The TransMed assembly consists of several such tools: [Ada](https://ada.parkinson.lu/documentation/intro) is a web-based tool to provide a performant and highly configurable system for secured integration, visualization, and collaborative analysis of heterogeneous data sets, primarily targeting clinical and experimental sources. The assembly also includes other tools for specific data types, such as {% tool "atlas" %} that integrate features from various {% tool "ohdsi" %} applications for Electronic Health Record data in {% tool "omop-cdm" %} format into a single cohesive experience. {% tool "transmart" %} is a tool that provides easy integration between phenotypic/clinical data and molecular data and a “drag-and-drop” fashion data exploration interface.

### Data stewardship

To facilitate the findability of data the TransMed assembly provides a [Data/Sample Catalog tool](https://datacatalog.elixir-luxembourg.org/) that supports the indexing search and discovery of studies, data sets and samples accumulated in the context of projects from different sites and cohorts. The catalog implements a controlled-access model by integration with {% tool "rems" %}. Audit trailing of data access is achieved by integration of the {% tool "daisy" %} in the access process. The catalog tool can be integrated with various identity management systems such as {%tool "keycloak" %}, {% tool "life-science-login" %} or {% tool "free-ipa" %}.
To facilitate the findability of data the TransMed assembly provides a {% tool "data-catalog" %} tool that supports the indexing search and discovery of studies, data sets and samples accumulated in the context of projects from different sites and cohorts. The catalog implements a controlled-access model by integration with {% tool "rems" %}. Audit trailing of data access is achieved by integration of the {% tool "daisy" %} in the access process. The catalog tool can be integrated with various identity management systems such as {%tool "keycloak" %}, {% tool "life-science-login" %} or {% tool "free-ipa" %}.

0 comments on commit fc4daf8

Please sign in to comment.