Data quality and metadata

Even when published under an open licence and adhering to FAIR data principles, data must still be quality in order to be usable

The quality of data is determined by a set of practical, legal, social and technical requirements, defined by community-driven standards.

In addition to the open licence, there are three legal requirements that need to be considered by open data publishers. You must:

  • Protect sensitive information like personal data

  • Preserve the rights of data owners

  • Promote the correct use of data

In order to meet the practical requirements of quality open data, publishers must:

  • Link to the data from the relevant website

  • Update the data regularly if it changes

  • Commit to continue to make the data available

There are three recommendations that define the technical aspects of open data:

  • The format in which the data is published

  • The structure of the data

  • The channels through which the data is available

For data use to be sustainable, it is important to have an engaged community of users. The best datasets have:

  • Active support channels

  • Discussion groups and forums

  • Published how-to guides on working with the data

Good quality open data should also be referenced through metadata – data that describes the content, structure or use of a dataset. This can include basic information such as the name of the author, a description date created, date modified and file size.

Metadata helps makes data easier to find for potential users, helps them understand whether the data would be useful for them and understand how others may have used the data before them.

For more information about metadata, take a look at Opendatasoft's 'What is metadata and why is it as important as the data itself?'

Last updated