RDLRDLEcosystem
News

Why Metadata Matters (and How to Write Good Metadata)

Metadata is the data about your data — and it decides whether your work can be found and reused. Learn what good metadata contains and how to write it so both humans and machines understand your dataset.

The data about your data

Metadata is the structured description of a dataset — its title, creators, date, methods, variables, and licence. It is small, but it does the heavy lifting: metadata is what search engines index, what repositories display, and what lets a future reader understand a file they did not create.

Why it decides discoverability

A dataset with no metadata is invisible. Even a brilliant dataset cannot be found, cited, or reused if nothing describes it. Rich metadata is the first of the FAIR principles (Findable) and the foundation for the rest.

What good metadata contains

  • Who — creators and contributors (ideally with ORCID iDs).
  • What — a clear title and description of the dataset's content.
  • When — collection and publication dates.
  • How — the methods, instruments, and processing steps.
  • Terms — the licence and any access conditions.
  • Structure — what each file, table, and field contains, with units.

Tips for writing it well

Write for a stranger, not your future self. Prefer standard vocabularies over free text where they exist, spell out abbreviations, and never assume context that lives only in your head. Good metadata takes minutes and pays off for years.