Is it sufficient to make my data openly available?
No - openness is a necessary but not sufficient condition for maximum reuse. Data have to be FAIR in addition to open.
What do the FAIR principles mean/imply for different stakeholders/audiences?
Researchers may be reluctant to share their data because they are afraid that others will reuse them before they have extracted the maximum usage from them, or that others might not fully understand the data and therefore mis-use them. You may publish your data to make them findable with metadata, but set an embargo period on the data to make sure that you can publish your own article(s) first.
Is making my data FAIR a lot of extra work?
No necessarily. Making data FAIR is not only the responsibility of the individual researchers but of the whole group. The best way to ensure your data is FAIR is to create a Data Management Plan (DMP) and plan everything beforehand. During the data collection and data processing follow the discipline standards and measures recommended by a repository.
I want to share my data. How should I license them?
First of all think about who owns the data? A research funder or an institution that you work for. Then, think about authorship. Applying a suitable license to your data is crucial in order to make them reusable.
I cannot make my data directly available - they are too large to share conveniently/have restrictions related to privacy issues. What should I do?
You should talk to experts in domain specific repositories on how to provide sufficient instructions to make your data findable and accessible.
Research data are often the most valuable output of many research projects, they are used as primary sources that underpin scientific research and enable derivation of theoretical and applied findings. Open research data is data that can be freely accessed, reused, remixed and redistributed, for academic research and teaching purposes and beyond. Ideally, open data have no restrictions on reuse or redistribution, and are appropriately licensed as such. In some cases, e.g. to protect the identity of human subjects, special or limited restrictions of access are set. Openly sharing data exposes it to inspection, forming the basis for research verification and reproducibility, and opens up a pathway to wider collaboration.
The best practice recommendation for open research data it for the data to be as open and FAIR as possible, while accounting for ethical, commercial and privacy constraints with sensitive data or proprietary data.
The FAIR data principles is a core set of principles to optimise the reusability of research data.
Most researchers are more or less familiar with Open Access publishing of research articles and books. More recently, and for the reasons mentioned above, data publishing has gained increasing attention. More funders expect the data produced in research projects they finance to be findable, accessible and as open as possible.
There are several ways to make research data accessible:
Make use of the UFS's institutional data repository, figshare, where possible. Funders might require you to deposit your data in a specific repository. re3data can be used to discover other available data repositories.
Important: Start planning where to deposit or publish your research data already in your data management plan (DMP). Consider which data and associated metadata, documentation and code will be deposited. Ask yourself how long the data will need to be retained. And for how long the data should remain reusable. How will your data be made available? What access will you provide? Remember, if your dataset is to 'count' as a publication/research output, it should follow a similar publication process as an article - properly documented with metadata, reviewed for quality, searchable and discoverable in databases, and citable in articles.
Data citation services help research communities discover, identify, and cite research data (and often other research objects*) with confidence. This typically involves the creation and allocation of Digital Object Identifiers (DOIs) and accompanying metadata through services like DataCite and CrossRef, and can be integrated with research workflow and standards. This enables research articles to be linked to any underlying data, and legitimises research data as contributions to the process of scholarly communication. It can also help to recognise new metrics and publication models, as well as pave the way for rewarding data sharing.
Read more about data citation principles.
*In addition to data sharing, the openness of research relies on sharing of materials. Here are some examples of what you can share, although it will be discipline specific or sometimes unique to a lab:
With appropriate data management planning much sensitive and proprietary data can be shared, reused, and FAIR. The metadata can almost always be shared. Guidance and best practices for sharing sensitive data are necessarily region-specific because of differing regulations, e.g. POPIA for researchers in South Africa.
Consult with your ethics review board on de-identification of personal research data. Some datasets will never be suitable to safely de-identify and share. Researchers can still improve the openness of research on such data by creating and sharing synthetic data. Synthetic data is similar in structure, content, and distribution to the real data and aims to attain "analytic validity": statistical analysis will return the same results for synthetic data as the real data.