Skip to content

Annual Report TDWG Data Quality Interest Group for 2021

Arthur Chapman edited this page Jun 26, 2023 · 11 revisions

Phase of work:

The Interest Group currently has 4 Task Groups

  • TG1 – Framework on Data Quality
  • TG2 – Data Quality Tests and Assertions
  • TG3 – Data Quality Use Cases
  • TG4 – Best Practices for Development of Vocabularies of Value

Activities:

  • See under each of the Task Groups below.

Accomplishments:

  • Little progress during the year but see under Task Groups 2 and 4

Impediments to progress:

  • COVID-19 and inability to formally meet

Changes in goals or scope:

  • Winding up of Task Groups 1 and 3 delayed.

Plans for next calendar year:

  • Finalize and get sign off on the CORE tests
  • Finalise Code and the Test Datasets for the Tests and Assertions
  • Submit Tests and Assertions as a TDWG Standard
  • Submit Best Current Practice for building vocabularies of values as a TDWG BCP.
  • Liaise with Annotations Interest Group on standardising Assertions as Annotations
  • Continue liaison with ALA, GBIF, iDigBio and others on harmonizing/aligning Data Quality procedures.
  • Encourage uptake of standard tests and assertions
  • Outreach and dissemination of information
  • Finalize, Document and Close Task Groups 1 and 3
  • Initiate a new Task Group to develop a representation of the framework as a TDWG Technical Specification (details still to be worked out).

TG1: Framework on Data Quality Task Group

Phase of work:

  • Approaching wind-up of TG

Activities:

  • Preparation of document for final report of Task Group

Accomplishments:

  • No Progress due to COVID-19

Impediments to progress:

  • COVID-19 and inability to formally meet
  • Leader having contracted COVID during the year.

Changes in goals or scope:

  • Task Group to be wound up in 2022

Plans for next calendar year:

  • Consolidate and document all the outcomes from Task Group 1 in a comprehensive public final report.
  • A final recommendation from the Task Group will be for a proposed new Task Group (see Task Group 5, below) to develop a representation of the framework as a TDWG Technical Specification.

TG2: Data Quality Tests and Assertions Task Group

Phase of work:

  • Completing test data suite (last task before submission as a standard).

Activities:

Accomplishments:

  • Tests and their specifications (based on a single template) are final
  • Test data template agreed
  • Code has been written to extract the parameters of each of the tests to RDF and we believe that this will form the basis of the proposed TDWG standard for the Tests and Assertions. Finalized 99 CORE tests and documented them against a standard template: https://github.com/tdwg/bdq/issues?q=is%3Aissue+is%3Aopen+label%3ATest.

Impediments to progress:

  • Inability to meet ‘face-to-face’.
  • Busy TG2 members
  • ‘Burnt out’ TG2 members. This work has taken much longer than anyone in the group anticipated. This has largely been due to the complexity of the task. COVID-19

Changes in goals or scope:

  • Zero

Plans for next calendar year:

  • Proof and finalise Test Data and make available for public review
  • Develop a technical specification (see Task Group 5, below)
  • Submit the work of TG2 as a TDWG standard.

TG3: Data Quality Use Cases Task Group

Phase of work:

  • All work completed

Activities:

  • No work carried out during 2021

Accomplishments:

Impediments to progress:

  • COVID-19

Changes in goals or scope:

Plans for next calendar year:

  • Task Group to be wound up in 2022

TG4: Best Practices for development of Vocabularies of Value Task Group

Phase of work:

  • Preparing best practices document

Activities:

  • Preparing best practices document
  • Vocabularies building - joint effort with the GBIF Informatics team.
  • Community engagement:
    • NAOC Bird Data Harmonization Workshop. Virtual. 22-26/02/2021. Building vocabularies with the Ornithologists community.
    • TDWG Controlled Vocabularies Workshop (WKSH05) TDWG 2021.
    • Translation of vocabularies (Workshop at TDWG 2021)

Accomplishments:

  • Very slow progress during 2021 due to convener personal circumstances.
  • Progress on translation of some vocabularies at TDWG 2021.

Impediments to progress:

  • COVID-19 and convenor personal circumstances

Changes in goals or scope:

  • None

Plans for next calendar year:

  • The Task Group does not plan to propose a new data standard or any modification to existing ones but intends to provide a best current practice for building TDWG vocabularies of values.
Clone this wiki locally