Lesson plan 2: Data management plans (DMP)
FAIR elements: All (see Summary of Tasks/Actions 1. a) for more detail)
Primary audience(s): Bachelor's, Master's, PhD degree students
Learning outcomes:
Can describe what a data management plan is
Can explain why data management planning is a step towards FAIR
Can tell which areas should be covered in a DMP
Can sketch a DMP for their own research project
or (depending on scope and intensity of the lesson): Can develop detailed DMP according to funder requirements and engage with relevant university instances/authorities
Can collaborate on a DMP and modify the plan during the project ('living document')
Can apply principles to protect personal sensitive data and develop a data protection impact assessment, if required (depending on discipline)
Can summarise best practices in data quality (principles, benefits, standards and tools)
Understands when it is appropriate to create plans and knows the difference between DMP and other types of documents for the project, e.g. project management plan
Knows tools, guides, templates and other types of support for DMP creation
Knows the common difficulties during DMP creation
Understands the concept of the machine-actionable DMP
Summary of tasks/actions:
Introduction to data management plan (DMP)
DMP with reference to FAIRness A good data management plan covers all FAIR principles (Findable, Accessible, Interoperable, Reusable)(17). A DMP helps to make the data Findable (F principle) because it includes all information about where data is stored and preserved, during and after the project. Moreover, a DMP also contains information about persistent identifiers, e.g. DOI, along with a description of the data and metadata standards used. A DMP helps to make the data Accessible (A principle) because it also includes information about how data can be accessed, what is required to access the data (authentication or authorisation) and by what (standardised and universal) communications protocol, e.g. HTTP, HTTPS. A DMP helps to make the data Interoperable (I principle), indicating which metadata standards, vocabularies, methodologies, and tools were used to facilitate interoperability. Moreover, a machine-actionable DMP also helps to address the ability of different systems and services to exchange both metadata and data produced during the project. A DMP helps to make the data Reusable (R principle) because it allows data to be described with more detail and accuracy, making it easier for others to understand. Moreover, during DMP creation, it is necessary to indicate the information that is needed to prepare the data for sharing and reuse with appropriate licences and rules, namely, how the data can be reused and for whom the data may be valuable.
When is a DMP needed, at what stage of the project?
Content of a good DMP
Context of the project (brief description and examples)
Data and resources produced/collected during the project (brief description of the type and formats of the data; examples)
Methodologies used for data collection (brief description and examples)
Organisation of the data during the project and in datasets (brief description of the structure and names of the folders and files; examples)
Metadata and metadata standards (brief description and examples
Documentation (brief description of the additional documentation, such as confidentiality agreements, agreements between partners, informed consent, authorisation by Ethics Committee, Data Protection Impact Assessment (DPIA) or Data Protection agreement that can substitute DPIA; examples)
Data quality procedures during data collection, data processing, data sharing and reuse
What does data quality mean in research data management?
Quality assurance guidelines (data description, metadata standards, documentation, data checking, etc.)
Ensure quality control (curation processes, data entry programs, use of standardised data formats, etc.)
documenting the calibration of instruments
taking duplicate samples or measurements
standardised data capture, data entry or recording methods
data entry validation techniques
methods of transcription
peer review of data
Data quality for publishing in repositories (completeness, uniqueness, timeliness, validity, accuracy, consistency)
Data quality assessment (data quality checklist)
Ethics and intellectual property (brief description and examples)
Data sharing (data access and reuse) (brief description and examples)
Data storage and backup (brief description and examples)
Selection and preservation of data (brief description and examples)
Responsibilities for managing data and resources (brief description and examples)
Additional information (such as the DMP monitoring and update process, and its importance) (brief description and examples)
Tools for DMP creation
DMPOnline (brief description and demonstration of the tool)
Data Steward Wizard (brief description and demonstration of the tool)
Argos DMP (brief description and demonstration of the tool)
Guides and templates that help create a DMP
Guides developed by government institutions and funders (e.g. Guidelines on FAIR Data Management in Horizon 2020) (brief description and examples)
Guides for specific domains, e.g. cancer research, clinical research, biological research (brief description and examples)
Checklists, frameworks, e.g. Digital Curation Centre (DCC), Inter-university Consortium for Political and Social Research (ICPSR), Framework for Creating a Data Management Plan (brief description and examples)
Support for DMP at the institution
Data Steward (brief description and responsibilities)
Data Protection Officer (brief description and responsibilities)
Research data support in library (brief description and responsibilities)
Other types of support, e.g. IT staff, grant administrator, funder officer, project managers (brief description and responsibilities)
A different approach to DMP creation for sensitive, personal and private data
Difference between these types of data (brief description and examples)
Additional documents and procedures, GDPR, connection with ethics committee, DPO, DPIA (brief description and examples)
Common difficulties in DMP creation (brief description of each point and examples)
Creation of the DMP for a project relevant for learners (practice session with a presentation and defence)
Materials/Equipment
Computer/laptop
Internet
DMPOnline or other tool that helps to create a DMP
References
Definitions
Clare, C., et al.: The Cookbook, Engaging Researchers with Data Management (2019). https://doi.org/10.11647/OBP.0185
Michener WK (2015) Ten Simple Rules for Creating a Good Data Management Plan. PLoS Comput Biol 11(10): e1004525. https://doi.org/10.1371/journal.pcbi.1004525
Dominik Schmitz, Daniela Hausen, Ute Trautwein-Bruns: Content of a Data Management Plan. RWTH Aachen University. 2020. Available at DOI: http://doi.org/10.18154/RWTH-2019-10064, https://youtu.be/fcCj6sNvoOw
Research Data Netherlands: The what, why and how of data management planning, 2014, https://youtu.be/gYDb-GP1CA4
Juran, Joseph M., and A. Blanton Godfrey. Juran's quality handbook: Fifth Edition. McGraw-Hill Education, 1998. Available at: https://gmpua.com/QM/Book/quality%20handbook.pdf
Chapman, Arthur D. Principles of data quality. GBIF, 2005. https://docs.niwa.co.nz/library/public/ChaArPrindq.pdf
Miksa T, Simms S, Mietchen D, Jones S (2019) Ten principles for machine-actionable data management plans. PLoS Comput Biol 15(3): e1006750. https://doi.org/10.1371/journal.pcbi.1006750
Science Europe: Practical Guide to the International Alignment of Research Data Management, https://doi.org/10.5281/zenodo.4915861
Tools
Useful links
Use cases / Examples of DMP
Karimova Y., Ribeiro C., David G. (2021) Institutional Support for Data Management Plans: Five Case Studies. In: Garoufallou E., Ovalle-Perandones MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_29
Barbosa, Susana & Karimova, Yulia. (2020). SAIL Data Management Plan (Version 1.0.0). Zenodo. https://doi.org/10.5281/zenodo.4286210
Diepenbroek, M., et al. (2014). Biodiversity and Ecological Research Data: Towards an integrated biodiversity and ecological research data management and archiving platform: the German federation for the curation of biological data (GFBio). In: Plödereder, E., Grunske, L., Schneider, E. & Ull, D. (Hrsg.), Informatik 2014. Bonn:Gesellschaft für Informatik e.V. (p. 1711-1721). https://dl.gi.de/handle/20.500.12116/2782
Use cases/Examples of data quality processes
Biodiversity:
OECD (2017), "Data quality", in OECD Handbook for Internationally Comparative Education Statistics: Concepts, Standards, Definitions and Classifications, OECD Publishing, Paris, https://doi.org/10.1787/9789264279889-9-en.
Chapman, A. D. 2005. Principles of Data Quality, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen. Url:https://docs.niwa.co.nz/library/public/ChaArPrindq.pdf
Chapman, A., Belbin, L., Zermoglio, P., Wieczorek, J., Morris, P., & Nicholls, M. et al. (2020). Developing Standards for Improved Data Quality and for Selecting Fit for Use Biodiversity Data. Biodiversity Information Science And Standards, 4. doi: https://doi.org/10.3897/biss.4.50889
Medicine and Biomedicine:
Geospatial:
Sensoring:
SAIL and Sensor data quality control procedures:
Take-home tasks
Analysis of existing metadata standards: https://rdamsc.bath.ac.uk/scheme-index and https://fairsharing.org/standards
Choosing the right licence for data, e.g. https://ufal.github.io/public-license-selector/, more information on this can also be found in lesson plan 9
Analysis of the DMP examples for scientific domain relevant to learners
Analysis of the examples of the data quality procedures
Datasets validation from data quality perspective
Creation of a data quality policy for an specific use case
Creation of the DMP for a project relevant for learners
Preparation of a presentation for defence
(17) https://www.go-fair.org/fair-principles/
Last updated