TrackFind

About

Thousands of genomic annotation tracks have been generated the recent years, many in the context of larger undertakings such as BLUEPRINT and ENCODE. Several data portals for tracks are providing search services to researchers, but the underlying metadata is diverse and often poorly curated. The Trackhub Registry provides a unified access point, but currently only supports limited search capabilities.

In the context of the Elixir Implementation Study: “FAIRification of Genomic Tracks”, we have developed the TrackFind service. TrackFind supports crawling of the TrackHub Registry and other data portals to fetch track metadata. Crawled metadata can be accessed through hierarchical browsing or by search queries, both through a web-based user interface, and as a RESTful API. TrackFind supports advanced SQL-based search queries that can be easily built in the user interface, and the search results can be browsed and exported in JSON or GSuite format. The RESTful API allows downstream tools and scripts to easily integrate TrackFind search, currently demonstrated by the GSuite HyperBrowser and EPICO.

In addition to supporting most metadata models directly, TrackFind also supports the transformation of metadata into the FAIR model defined in the “FAIRification of Genomic Tracks” Implementation Study. Such transformation can be achieved on per-TrackHub basis through online scripting, thus providing a simple path for data managers to FAIRify their track metadata. TrackFind also maintains a version history of all metadata changes, including all recrawlings and transformations. We are also planning to add functionality for curating existing track metadata content.

We believe the TrackFind track search engine and metadata FAIRification service to be a major contribution, both to maintainers of genomic annotation track data, as well as to researchers and tool developers interested in making use of the wealth of track data publicly available.

Development

TrackFind is written in Java 13 and uses Maven 3 as a build-tool.

Deployment

Configuration

The following environment variables must be set before launching the setup:

Variable name	Default value	Mandatory	Description
SERVER_NAME		✔	External domain name
PROXY_PASS		✔	Internal (to Docker) host
ELIXIR_AAI_CLIENT_ID		✔	Elixir AAI client ID
ELIXIR_AAI_CLIENT_SECRET		✔	Elixir AAI client secret
REDIRECT_URI	/oidc-protected		OIDC entry-point
CRYPTO_PASSPHRASE	salt		Password for crypto purposes, this is used for either encryption of the (temporary) state cookie or encryption of cache entries, that may include the session cookie
RESPONSE_TYPE	code		["code"/"id_token"/"id_token token"/"code id_token"/"code token"/"code id_token token"]
REMOTE_USER_CLAIM	sub		The claim that is used when setting the REMOTE_USER variable on OpenID Connect protected paths
REQUEST_FIELD_SIZE	65536		A server needs this value to be large enough to hold any one header field from a normal client request
PROTECTED_RESOURCES		✔	A comma-separated list of protected endpoints, e.g.: /admin,/actuator
SSL_ENGINE	off		Enables or disables TLS termination (if set to "on", the certificate chain and a private key should be injected)
POSTGRES_USER		✔	PostgreSQL username
POSTGRES_PASSWORD		✔	PostgreSQL password
SPRING_PROFILES_ACTIVE		✔	["dev"/"prod"]
ADMIN_ELIXIR_ID		✔	Elixir AII ID of the TrackFind initial admin user

Docker Compose

Simply execute docker-compose up and it will spin up the setup with PostgreSQL database, ElixirAAI-enabled Apache reverse-proxy, GSuite converter microservice and TrackFind microservice itself.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TrackFind

About

Development

Deployment

Configuration

Docker Compose

Acknowledgments

Files

README.md

Latest commit

History

README.md

File metadata and controls

TrackFind

About

Development

Deployment

Configuration

Docker Compose

Acknowledgments