
Research Data
Publishing of research data as open as possible, as closed as needed is the point of departure at WUR. This comes with copyright related questions.
What are research data?
What is a dataset?
Who owns the research data that I create?
Data ownership is a complicated issue because it deals with legal rights and contractual agreements.
Elements to be considered:
- Data that is factual has no copyright protection. Most research data is factual.
- Copyright at the item level is limited to items that meet the threshold for copyright protection (see question: What is copyright and what is protected? on the page General Information).
- A database can be copyright protected, if such database, by reason of the selection or by arrangement of its contents, constitutes the author’s own, sufficiently original, intellectual creation (see question: What can I do with my copyright on the page General Information).
- Databases can also be protected by a sui generis database right, if substantial qualitative or quantitative investments have been made in obtaining, verifying or presenting the contents of the database concerned. A database right gives the database’s creator the right to prevent extraction and/or re-utilization of the whole or a substantial part of that database’s contents.
- Copyright within a dataset involves elements, such as annotations, visualizations, and metadata that are part of a dataset and meet the threshold for copyright protection (see question: What is copyright and what is protected? on the page General Information).
- Even if a dataset is not protected by an intellectual property right, agreements can be made as to the use thereof, for instance, confidentiality. In contrast to the rights associated with intellectual property rights, such abovementioned agreements involve only the party with whom such an agreement was made.
What is research data management?
Does WUR have a research data ownership policy?
What are WUR's rules for (re)using and sharing research data?
How can I publish my data?
Are text and datamining (or scrapping) allowed under copyright law?
WUR has the right to use automated analytical techniques to analyse digital texts and research data if the gathered data is used for research. Such research has to be focused on the generation of patterns, trends and correlations that will help, for example, to train or to develop AI or models.
The AI or models can be published, but the researcher may not publish or distribute the information retrieved with text and data mining (TDM), such as complete scientific articles or their abstracts. However, other metadata can be published.
TDM usually consists of four stages:
- Identifying and retrieving data.
- Converting the data to a machine-readable format.
- Extracting the information from the data.
- Using the information in research.
This TDM-right is limited to research data to which WUR has lawful access. TDM-arrangements are increasingly included in inter-university agreements with large data providers, such as publishers. The research data that is retrieved and stored needs to be subject to appropriate security measures and cannot be stored longer than necessary.
Last updated on 04/01/2022.