Skip to content

ejp-rd-vp/resource-metadata-schema

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GitHub GitHub tag (latest by date) GitHub tag (funder)

Metadata for EJP rare disease patient registries, biobanks and catalogs

As part of the European Joint Programme (EJP) for Rare Disease, we are developing standards for rare disease registries to describe their metadata that will improve the FAIR-ness of these resources.

The core model is designed to represent data about a rare disease patient and biosample registries. The model is based on and builds on existing standards, such as the European Rare Disease Registry Infrastructure and the Common Data Elements from the rare disease community and other more generalised standards for data sharing such as the W3C DCAT vocabulary. We are also working to align with similar schema standarisation efforts such as RD connect semantic model, schema.org, bioschemas, MIABIS and GA4GH (see also schema blocks and phenopackets. A proposed semantic model for the Common Data Elements can be found here.

Status: This is version 1.0 of EJP RD metadata model.

Metadata modules overview

The figure below gives an overview of upper level concepts and properties used in our metadata model.

You can browse different metadata modules by visiting the links below.

Ontologies used in the metadata model

To describe different metadata modules listed above we used various exiting ontologies and vocabularies which are listed below.

Concepts to describe resources that we use in the Rare diseases domain such as biobank and patient registry are not defined in resource description vocabulary as a DCAT. To overcome this issue we have extended DCAT with by added missing concepts to DCAT vocabulary.

Implementation of metadata model

The metadata modules described in this repository are provided with RDF examples and RDF validation artifacts. You get more benefits if you implement metadata model in RDF. However implementation of our metadata model is not limited only to RDF.

Use cases

Below you can find some use cases which can be addressed by proposed metadata modules.

  • Provide minimal metadata to describe a rare disease registry or biobank, or a catalog of registries or biobanks. The metadata should be sufficient to expose data about these resource through the virtual platform
  • Provide a uniform way for resources to expose the primary disease using a Orphanet code so that resources can be searched by disease in the virtual platform
  • Provide a mechanism to identify resources and harmonies duplicate resources across catalogs
  • Provide geographical information so resources can be filtered by country in the VP
  • Expose if resources provide metrics about individuals, such as number of cases
  • Expose if a resource has access to biological samples, such as tissue or cell ines.
  • Expose if the resource has further contact information
  • ...

Configurations to provide metadata records according to the EJP-RD schemas

  • FDP reference implementation or FAIR in a Box (FiaB): here
  • Metadata schema reference implementation (lvl1): here