Research data refers to any data with which the analysis and results of a study can be repeated and validated. The data may have been collected by the researcher, generated during the study or consist of pre-existing archival data, and include various measurement results, survey and interview data, notes, research diaries, software or source codes.
Data management refers to the systematic collection, processing, storing and description of research data. Students are encouraged to learn about data management early in their studies, because good data management skills are beneficial to study progress and to adopting suitable data management practices during the thesis-writing process.
Data management practices should seek to comply with the FAIR principles, ensuring that the data is
This is achieved, for example, through the use of open file formats, comprehensive metadata, and persistent identifiers (e.g., DOI, URN, ORCID), and defining ownership, terms of use and licenses. Learn more about the FAIR principles and the policy component for open access to research data.
Before you collect any data, record the most suitable data practices in a Data Management Plan (DMP) that can be supplemented as the work progresses and plans become more accurate. Formulating a plan will help you identify potential data protection risks, as well as solutions suitable for storing and describing your data. Careful data management also allows you to make the data accessible for potential reuse and thus improve the reliability of your research and the repeatability of the results.
Planning can be done using the DMPTuuli tool that is accessible with your HAKA credentials. DMPTuuli contains templates and instructions that can be applied to the data management plan of a thesis.
Choose a secure storage solution for your data, based on the demand for the data and its confidentiality level. Secure storage, version control and backup help prevent any unintentional deletion of data. Open file formats, logical file naming and folder structure, as well as rich content descriptions facilitate the findability, intelligibility and sharing of data. Consider the following questions when choosing the storage solution:
NB: External storage media, such as flash drives, are not recommended as primary storage solutions, because data stored on them is susceptible to becoming lost, deleted and unintentionally shared with outsiders.
Backing up helps you decrease the risk of irreparable damage to or deletion of data. Always keep separate working and backup copies of the research data. Choose storage solutions that include automatic backup. Backing up should be based on the 3-2-1 Rule, meaning that data is stored in the following way:
To ensure the usability of your data on a variety of devices and software, using open, non-commercial file formats is recommended. Most software supports the following common file formats:
Systematic file naming practices and folder structures ensure the identifiability and findability of your data, even when there are time lapses in processing it. Clear file naming also simplifies file sharing. When you name a file:
NB: If you use abbreviations, remember to define them in writing so that they can be understood.
Document the basics of your data during the thesis writing process to ensure the findability and usability of your data. Documenting makes it easy to check the contents of your data, how it has been processed and where it is stored. The simplest option is to record the descriptive data (or metadata) related to your data in a text file (a.k.a. README file) that you save as a separate file along with your data. Metadata may also be published according to the description guidelines of the particular publishing service. Record at least the following information in the file:
Read more about storing, file naming, recommended file formats and documenting in the Data Management Guidelines of the Finnish Social Science Data Archive.
Take care of your research data even after the completion of the thesis. Electronic data requires further measures to stay up-to-date, and not all data needs to be archived for long periods of time. Based on the reuse value of your data, choose appropriate measures, such as data archiving, publishing or deletion. Keep in mind that your right to use the University of Vaasa IT services expires after graduation, unless you continue in another university role, such as a position of doctoral researcher or employee. If the data is stored in the University of Vaasa systems, remember to transfer or delete it before your access rights expire.
If your data contains personal details, it is usually deleted after the thesis has been accepted. Keep in mind that moving a file to the recycle bin does not sufficiently delete the data. More thorough measures, such as overwriting a drive or mechanically destroying a flash drive, are needed. Further information on deleting the data: Office of the Data Protection Ombudsman or Data Management Guidelines of the Finnish Social Science Data Archive.
If the data has reuse value and you have permission to reuse or publish the data, you may publish or archive your data in a chosen data archive. Keep in mind that you may need permission for data reuse or publishing from your research subjects or potential customer, and that data anonymisation may be a condition for publishing. For example, the Finnish Social Science Data Archive, The Language Bank of Finland, and Fairdata’s IDA and Qvain offer domestic solutions for publishing data and related metadata, while Zenodo or EUDAT B2Share are some of the international service provider options.
Data protection refers to the safeguarding of personal data. The notion of personal data is broad, and what qualifies as personal data is any information that either directly or indirectly enables the identification of a person, for example by connecting an individual piece of information to another piece of information. More information on personal data: Office of the Data Protection Ombudsman. Personal data processing related to studies must adhere to the principles of the University of Vaasa data protection policy: University of Vaasa Information Security Policy.
The student collecting personal data acts as the data controller.