20.17 Data management
Effective data management is a cornerstone of rigorous research. It ensures the proper organization, preservation, and sharing of research data, aligning with principles of transparency, reproducibility, and ethical responsibility. This section outlines the key principles and processes for research data management, highlights available tools, and links to relevant guidelines and resources.
Principles of Research Data Management
Research data management refers to the organization, storage, preservation, and sharing of data collected and used during a research project. This practice is essential to ensure data is:
- FAIR: Findable, Accessible, Interoperable, and Reusable
- Secure: Protected from unauthorized access or loss
- Organized: Clearly documented for ease of use and understanding
- Reproducible: Ensuring that future researchers can validate results by using the same data
Several principles should guide researchers in managing their data:
- Data Collection and Documentation: Ensure accurate and thorough documentation of the data collection process.
- Data Storage and Security: Store data securely during the research project, ensuring data integrity and confidentiality.
- Data Sharing and Accessibility: Share data in open-access repositories where possible to comply with funder requirements and maximize the impact of your research.
- Data Archiving and Preservation: Safeguard data for long-term access and reuse by archiving it in trusted repositories.
For further guidance, see the Science Europe guide on Research Data Management.
Data Reuse
Existing data repositories offer a good starting point to identify datasets that can be reused.
Data repositories:
Data Management Process
Below is a recommended process structure for managing research data throughout the lifecycle of a project:
- Planning Stage
- Develop a Data Management Plan (DMP) that outlines how the data will be collected, stored, and shared.
- Ensure compliance with institutional and funding body requirements.
- Identify relevant data policies from your funder or publisher.
- Data Collection and Organization
- Establish clear protocols for data collection.
- Use standardized formats and file naming conventions to ensure consistency and ease of use.
- Document metadata thoroughly to facilitate future understanding and use of the data.
- Data Storage and Security
- Store data in secure, backed-up locations during the project to protect against loss.
- Ensure that sensitive data is encrypted and access-controlled where necessary.
- Use platforms like OSF for project collaboration and version control.
- Data Sharing and Accessibility
- Publish data in open-access repositories where possible to comply with funder and publisher requirements (e.g., via GitHub - Zenodo).
- Add the
dataset
tag to the GitHub repository (see overview) - Create anonymous or view-only links for peer review using tools like Anonymous GitHub for peer review or OSF anonymous online datasets.
- Cite data appropriately in publications using DOIs or other identifiers.
- Data Archiving and Preservation
- After project completion, archive data in a trusted repository that supports long-term access and preservation.
- Ensure that your archived data is available under FAIR principles.
Institutional and Funder Requirements
Increasingly, institutions and funding bodies require detailed data management plans as part of project proposals. Below are guidelines from key institutions and organizations:
Institution | Requirements | Further Information |
---|---|---|
German Research Foundation (DFG) | The DFG provides guidelines on project planning, application processes, and safeguarding data. | DFG Guidelines on Research Data Management |
European Union (Horizon 2020) | The EU requires a DMP and open data policies for projects funded under Horizon 2020. | Guidelines on Open Access to Scientific Publications and Research Data |
National Science Foundation (NSF) | NSF project proposals must include a Data Management Plan to ensure data is appropriately archived and accessible. | NSF Data Sharing Policy |
German Rectors’ Conference (HRK) | HRK offers detailed guidelines for university leaders on establishing robust research data management practices. | HRK Guidelines |