User-centric data publishing (Alpha)
  • User-centric data publishing
    • Introduction
    • Who is this toolkit for?
    • How to use this toolkit
    • Dictionary of data terms
  • Contents
  • Section 1. Building the foundation for open data
    • A basic introduction to open data
    • Understanding our rights to access data
    • Open data maturity
      • Resources: Open data maturity
    • Ethics and transparency
  • Section 2. Planning for impactful open data initiatives
    • An introduction to the Data Landscape Playbook
    • Play one: Explore the problem and how data can address it
    • Play two: Map the data ecosystem
    • Play three: Assess the policy, regulatory and ethical context
    • Play four: Assess the existing data infrastructure
    • Play five: Plan for impact when designing your data initiative
  • Section 3. A user-centric approach to publishing
    • Understanding the user journey
      • The use case
      • Understanding different user needs
      • Targeting intended audiences
    • Engaging effectively with data users
      • Two-way communication and feedback
      • From data to story
    • Building communities around open data use
      • Characteristics of an open data user community
        • Purpose
        • Community enabler(s)
        • Collaborative method
        • Other observations
      • The current landscape of open data user communities
      • Engagement with data communities
    • Resources: User-centric publishing
  • Section 4: Publishing guidance for new data publishers
    • Open data licensing
    • The FAIR principles of data access
      • FAIR data assessment tools
    • Data quality and metadata
      • Tools and frameworks to help you assess open data quality
    • Publishing data on the web
  • Thank you
Powered by GitBook
On this page
  1. Section 4: Publishing guidance for new data publishers

Data quality and metadata

Even when published under an open licence and adhering to FAIR data principles, data must still be quality in order to be usable

The quality of data is determined by a set of practical, legal, social and technical requirements, defined by community-driven standards.

In addition to the open licence, there are three legal requirements that need to be considered by open data publishers. You must:

  • Protect sensitive information like personal data

  • Preserve the rights of data owners

  • Promote the correct use of data

In order to meet the practical requirements of quality open data, publishers must:

  • Link to the data from the relevant website

  • Update the data regularly if it changes

  • Commit to continue to make the data available

There are three recommendations that define the technical aspects of open data:

  • The format in which the data is published

  • The structure of the data

  • The channels through which the data is available

For data use to be sustainable, it is important to have an engaged community of users. The best datasets have:

  • Active support channels

  • Discussion groups and forums

  • Published how-to guides on working with the data

Good quality open data should also be referenced through metadata – data that describes the content, structure or use of a dataset. This can include basic information such as the name of the author, a description date created, date modified and file size.

Metadata helps makes data easier to find for potential users, helps them understand whether the data would be useful for them and understand how others may have used the data before them.

PreviousFAIR data assessment toolsNextTools and frameworks to help you assess open data quality

Last updated 2 years ago

For more information about metadata, take a look at Opendatasoft's

'What is metadata and why is it as important as the data itself?'