Skip to content

Winding up of Task Group 1 – Framework on Data Quality

Arthur Chapman edited this page Nov 20, 2023 · 1 revision

Convenor: Allan Koch

Charter: https://www.tdwg.org/community/bdq/tg-1/

The work of Task Group 1 has been completed, developing a conceptual proposal of a framework on data quality, based on which other TGs have been created and have been continuing the work. That includes the remaining work on framework vocabularies which is wrapped into the products for Task Group 2 and included in the documentation for the proposed BDQ-Core Standard.

Goals, Outputs and Outcomes

GOAL Develop a conceptual framework that serves as a common ground for a collaborative mapping of DQ needs and DQ methods, tools and DQ reports for DQ Assessment and Management based on data fitness for use.

PROGRESS The main goal of the task group has been achieved and the main outcomes were the publication of the conceptual framework (Veiga, 2016; Veiga et al. 2017), its presentation in a webinar of the Biodiversity Informatics Training Curriculum (Veiga 2018), and a follow up publication that establishes a “common language” for the TDWG community (Chapman et al. 2020).

An important outcome was the engagement of the other task groups on using it as a common ground, and work to put the framework in practice in initiatives such as the Kurator Project (Morris et al. 2017) and Online Pollen Catalogs Network (RCPol) (Veiga et al. 2018). To spread and consolidate the principles and concepts in a more practical way, a paper has been written, entitled "Developing Standards for Improved Data Quality and for Selecting Fit for Use Biodiversity Data” (Chapman et al. 2020).

  • A formal Conceptual Framework for the Assessment and Management of the fitness for use of biodiversity data.
  • Establish a “common language” in order for the Biodiversity Informatics community to express and share their understanding of DQ needs and solutions, to increase the reusability and decrease the duplication of efforts.
  • A case study that describes how to use the Conceptual Framework for performing the Assessment and Management of fitness for use in an institution.
  • Published in the results of Task Group 3 – Data Quality Use Cases (Rees & Nicholls 2020) and there combined with the Framework.
  • Methods and guidelines to use the Framework.
  • Establish a common vocabulary for the whole DQ Interest Group.

Further notes, documentation and references can be found on the BDQ Interest Group Wiki at https://github.com/tdwg/bdq/wiki

References

  • Chapman AD, Belbin L, Zermoglio PF, Wieczorek J, Morris PJ, Nicholls M, Rees ER, Veiga AK, Thompson A, Saraiva AM, James SA, Gendreau C, Benson A, Schigel D (2020). Developing Standards for Improved Data Quality and for Selecting Fit for Use Biodiversity Data. Biodiversity Information Science and Standards 4: e50889. https://doi.org/10.3897/biss.4.50889
  • Morris PJ, Hanken J, Lowery DB, Ludäscher B, Macklin J, McPhillips T, Morris RA, Wieczorek, J and Zhang Q (2017). Fitness-for-Use-Framework-aware Data Quality workflows in Kurator. Biodiversity Information Science and Standards 1:e20379. https://doi.org/10.3897/tdwgproceedings.1.20379
  • Rees ER, Nicholls M (2020). Suppl. material 2: Data Quality Use Case Study Result. https://biss.pensoft.net/article/download/suppl/5255738/.
  • Veiga AK (2016). A conceptual framework on biodiversity data quality [online]. São Paulo : Escola Politécnica, University of São Paulo. Doctoral Thesis in Sistemas Digitais. [cited 2017-05-15]. http://www.teses.usp.br/teses/disponiveis/3/3141/tde-17032017-085248/.
  • Veiga AK (2018). BI Seminar #38: a Fitness-for-use Approach for Biodiversity Data Assessment and Management. Biodiversity Informatics Training Curriculum https://www.youtube.com/watch?v=FJ7HLjl5_fg
  • Veiga AK, Saraiva AM, Chapman AD, Morris PJ, Gendreau C, Schigel D, & Robertson TJ (2017). A conceptual framework for quality assessment and management of biodiversity data. PLOS ONE 12 (6): https://doi.org/10.1371/journal.pone.0178731
  • Veiga A, Saraiva M, da Silva C (2018). On Line Pollen Catalogs Network (RCPol). Biodiversity Information Science and Standards 2: e25658 https://doi.org/10.3897/biss.2.25658

Arthur D. Chapman and Antonio Mauro Saraiva (Co-Convenors, TDWG Data Quality Interest Group).

Clone this wiki locally