Skip to content

Data Standardization

nicohoermann edited this page May 4, 2021 · 1 revision

author: Nicolas G. Hoermann

Data Standardization

In our database we try to follow certain data standards, more or less closely.Some informative websites are listed at the bottom. For maximum compatibility and flexibility we follow the Data Package standard. The Data Package specification does NOT impose any requirements on their form or structure and can therefore be used for packaging any kind of data.

Data Package

###Definitions

Data Package consists of:

  • Metadata that describes the structure and contents of the Data Package
  • Resources such as data files that form the contents of the Data Package

Metadata

  • stored in a "Descriptor".
  • Metadata Descriptor MUST be a valid JSON object.
  • MUST be named datapackage.json and it MUST be placed in the top-level directory (relative to any other resources provided as part of the Data Package)
  • MUST contain a resources property describing the data resources, in form of a list of resource descriptors
  • All other properties are considered metadata properties. The descriptor MAY contain any number of other metadata properties.

Resource

  • Files bundled locally with the Metadata Descriptor
  • Remote resources, referenced by URL
  • "Inline" data, included directly in the Descriptor
  • The Data Resource Descriptor, that is listed in the Metadata Descriptor MUST be a valid JSON object.

###Metadata example###

jsondict = {
  "name" : "a-unique-human-readable-and-url-usable-identifier",
  "title" : "A nice title",
  "licenses" : [ ... ],
  "sources" : [...],
  # list of the data resources in this data package
  "resources": [...],
  # optionally other json key-value pairs
}

###Data Resource example### 1 resource with all required, recommended and optional properties

[{
  "name": "example data",
  "path": "http://example.com/example.csv",
  "title": "Random data 0",
  "description": "My favourite random data about nothing.",
  "format": "csv",
  "mediatype": "text/csv",
  "encoding": "utf-8",
  "bytes": 1,
  "hash": "",
  "schema": "",
  "sources": "",
  "licenses": ""
}]

Links

General

Data Packages

Clone this wiki locally