Data Management Plan
The data center of deep geothermal energy is dedicated to the collection, preservation and worldwide distribution of deep geothermal data. In order to provide state-of-the-art data management, a data management plan has been written and the requirements of the CoreTrustSeal certification are followed. The main goal is to make data FAIR: Findable, Accessible, Interoperable and Re-usable.
Data Findable
CDGP uses a cataloging application (GeoNetwork) to manage the metadata resources. It provides metadata editing and search functions as well as a web map viewer. The metadata editor supports ISO19115/119/110 standards used for spatial resources. A step forward will be to add specific metadata records as defined by the Open Geospatial Consortium to provide geophysical / geologic / reservoir information: Observations and Measurements (O&M) to describe the acquisition of information from a primary source, and SensorML to describe the sensors. Seismological metadata, which describe all the instrumental response, use the dateless SEED standard.
A DOI (Digital Object Identifier) will be assigned to each ‘episode’ dataset: time-correlated collections of geophysical, technological and other relevant geo-data over a geothermal area. This unique and eternally persistent identifier is specified in the metadata.
Data Accessible
An Authentication, Authorization and Accounting Infrastructure (AAAI) has been set up in order to ensure the good distribution of data according to Intellectual Property Rights (IPR), user’s affiliation (i.e. academic, industrial, …) and distribution rules, either automatically or after approval from the data owner. Data are also distributed through the EPOS-IP Anthropogenic Hazards platform. The distribution procedures is described in a workflow diagram.
Terms and conditions of use
Data Interoperable
The metadata is implemented so that all information needed to read, use and interpret data in the future is available.
The metadata and documentation of data formats is prepared and accessible at the level of data access. More information can be added through documentation. Thesaurii have been developed in order to normalize keywords within predefined categories. It facilitates the research by keywords and avoid the use of several spellings for a same word. It has been decided to create four types of keyword categories that are adapted to most of datasets:
Subject study, Project phase, Location, Variable
Data Re-usable
The licenses are defined in individual data owner/provider and CDGP agreements
The specific terms and conditions of use are defined in advance. In case of the H2020 projects it is forseen to make all the data open, if the data provider do not specify other access rules or embargo period.
Some restrictions of use can be applied on datasets. The license is available at the data level and when accessing it. In case of H2020 projects related data, the use of Creative Commons 4.0 International CC BY NC is recommended.
A data quality workflow has been set up in order to check that information necessary for re-use of data is provided. The general procedure of data processing is described in the functionnal diagram.