3. Describing research data (documentation, metadata)

From the beginning of the research project a careful documentation of research data is important. It ensures the comprehensibility of the research data both for yourself and for others. It also decreases the risk of false interpretation and significantly promotes the findability and reusability of your data. Without proper description, research data is just a collection of random files, numbers, and signs, and documentation, data use and reuse may become impossible. Investing time in documenting the data during the research project will also save time when you are publishing the dataset.

Sharing the metadata is always recommended to increase the visibility of the research, even in situations where the research material itself cannot be opened for some reason.

To consider

  • What does documentation mean in your field?
  • How does the documentation show in your research data?
  • What kind of metadata do you need during your research project?
  • How would you describe the basic information of your research data in a README file?

Watch the video

Metadata is crucial for the data to be found: The Elements of FAIR – Findable, CSC (8:49).

In brief

Good data description includes concise information on

  • the context of data collection (e.g., project aim and objectives)
  • data collection methods (e.g., sampling, data collection process, instruments, hardware and software used)
  • structure of data files
  • quality assurance procedures carried out
  • version control
  • information on access and use conditions or data confidentiality
  • names, labels and descriptions for variables, records and their values
  • explanation or definition of codes and classification schemes used
  • definitions of specialist terminology or acronyms used
  • codes of, and reasons for, missing values.

(2023-08)

Next: 4. Data storage during and after a research project