The WUR data policy requires data underlying publications to be archived for at least 10 years. Where to archive your data, depends on its sensitivity. For example, when data cannot be archived in e.g. a repository by publishing the data, they can and should be archived on the e.g. the WUR server (W-drive).
Have a look at the Data Sharing Infographic and the archiving options below to determine where you can archive your research data.
In a data repository
Data sets and related metadata can be deposited in a public data repository. Most data repositories allow you to place a - temporary - embargo on your data: during the embargo period the description of the data set is published, but the data themselves are not available for reuse by others until the embargo expires. There are no WUR guidelines on which repository to use, but we can give some recommendations:
- Disciplinary vs. multi-disciplinary repositories
Disciplinary repositories know how to handle and curate specific data types, they provide specific metadata standards improving interoperability and your data is easier found by peers within your research fied. If there is a disciplinary repository for your field of research that can accommodate your data, that should be your first choice. For example, disciplines like bioinformatics have a long tradition of publishing their data in NCBI, EBI, etc. If there is no disciplinary service for your field, you can choose a multidisciplinary repository, e.g. DANS-EASY, 4TU.ResearchData, Zenodo, Dryad, Figshare, Mendeley data.
The Data Desk assesses data repositories to ensure that they offer secure, durable storage and that they make data sets easily findable. This assessment focuses on a set of criteria, such as if the repository provides metadata, if it scans data files for corruption, has a back-up recovery system in place and if it assigns persistent identifiers to data sets. If a repository has been approved, it is recommended for data archiving as stipulated in the WUR data policy.
The table below shows the repositories approved so far. Do you have a repository you would like to get assessed? Please contact the Data Desk.
|Repository||Discipline||Associated journal(s) or publisher(s)|
|DANS-EASY||Multidisciplinary (focus on social sciences, but also life sciences)||None known|
|4TU Centre for Research Data||Technical sciences||4TU.ResearchData is integrated with GitHub to archive Git Repositories.|
|Dryad||Multidisciplinary (focus on life sciences)||Hundreds of journals offer integrated data submission with Dryad: browse the list.|
|Figshare||Multidisciplinary||Many publishers have a partnership with Figshare, including Springer Nature, PLOS, and Wiley.|
|Harvard Dataverse||Multidisciplinary (focus on social sciences)||Various publishers recommend Harvard Dataverse, and some journals have set up their own dataverse.|
|Zenodo||Multidisciplinary||No partnerships known, but recommended by many publishers. Zenodo is also integrated with GitHub to archive Git Repositories. EU repository, so handy for EU-projects.|
|Mendeley Data||Multidisciplinary||Integrated into the workflow of Elsevier journals.|
|Pangaea||Earth & Environmental Science||No partnerships or integrations known, but recommended as the standard repository in the discipline by various publishers.|
|ISRIC WDC-Soils||Soil sciences, Geosciences||None known|
|GBIF/NLBIF||Biology, Biodiversity||None known, but recommended by publishers including PLOS and Springer Nature.|
|NCBI: Genbank||Biology, Genetics||No partnerships known, but the use of Genbank is encouraged by many publishers. Examples are PLOS, Springer Nature and Elsevier.|
|EMBL-EBI: ArrayExpress, ENA, BioStudies, PRIDE, BioModels, IntAct, MetaboLights||Biology, Genetics, Bioinformatics||EMBL-EBI repositories are often recommended by publishers. Examples are PLOS, Springer Nature and Elsevier.|
- Data repositories supported by WUR Library
The Data Desk can publish your data set(s) for you at DANS-EASY, 4TU.ResearchData or Zenodo. Open Access to the data set is encouraged in these repositories, but restricted access is also possible. Check this page on how to publish in these repositories.
- A data repository supported by a journal
Many journals have an agreement with a data repository, i.e. Dryad or Figshare, where you can deposit your data during the publication process of your article. It may be convenient to choose that service, but this is not obligatory. Usually only Open Access is possible.
- On a project website
This is suitable when you want to publish a database that is still evolving. If you wish to publish a paper about this database, you can publish a version of the database in a data repository. Also remember to deposit the final version of the database in a data repository when the website ceases to exist.
- On a shared WUR server
When data should remain closed or when restricted access options follow from the WUR data sharing guidelines you may archive your data on a shared WUR server (W-drive). Make sure that only people with the appropriate rights can access the shared folder. If you need help, ask the Data Desk.
- As supplementary information to your research paper
Supplementary information files may be uploaded with the manuscript to the publisher's website. Supplementary information may contain tables and figures that are too large to incorporate in the article itself, or extra information on methods or materials. Regardless the content of the supplementary information, check whether you aren't transferring copyrights to the publisher when you upload supplementary information files. Besides, data becomes less findable and cannot be cited separately when put in the supplementary information. That’s why it is recommended to publish your data set in a data repository. Also, small data sets are welcome in a data repository.
- As a data paper (i.e. a description of your data) in a data journal
Larger data sets that may be used for another purpose than their original purpose may be suitable for a data paper. A data paper is a peer reviewed article describing openly accessible data sets for future reuse. Most well-known publishers publish a data journal. Wageningen Environmental Research and WUR Library publish a data journal for the agricultural sciences: ODJAR 'Open Data Journal for Agricultural Research'.