Breakout group about metadata

Here we want to discuss the issue of providing metadata about data.

-what should be the information to be included in the metadata for tables and values?
-the idea is to define a minimum set of data which need to be provided
-the type of the data provided defines the information to be provided about it (standards need to be defined)
-the terms used should either be used in the glossary of the openmod wiki or defined there when they are not available in the glossary

{ "title": "Austria final energy consumption",
  "description": "Final energy consumption per region and sector",
  "license": "Unknown",
  "keywords": [
    "Austria",
    "Energy",
    "Consumption",
    "Final energy consumption",
    "Time series"
  ],
  "version": "0.1",
  "last_updated": "2015-12-31",
  "image": "http://www.statistik.at/wcm/fragments/defaults/assets/logo_statistik.gif",
  "sources": [
    {
      "name": "Statistik Austria",
      "web": "http://www.statistik.at/web_de/statistiken/energie_umwelt_innovation_mobilitaet/energie_und_umwelt/energie/energiebilanzen/index.html"
    }
  ],
  "output": "output_consumption.csv",
  "info": {
    "note": "",
    "energy_definition": "final gross?",
    "energy_source": "total",
    "energy_type": "total",
    "created_on": "2017-04-03",
    "unit": "TeraJoule",
    "energy_vector": "total",
    "owner": "Statistik Austria",
    "measure_type_eustat_flag": "a",
    "nuts_level": 3,
    "nuts_label": "",
    "sector_eustat_code": "",
    "publication_year": 2016,
    "description": "",
    "geom": ""
  },
  "read": {
    "skip_rows": 23,
    "skip_footer": 9,
    "index_col": 0
  },
  "resources": [
    {
      "name": "burgenland",
      "type": "excel",
      "path": "energiebilanz_burgenland_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065501",
      "info": {
       "nuts_code": "AT11"
      }
    },
    {
      "name": "kaernten",
      "type": "excel",
      "path": "energiebilanz_kaernten_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065502",
      "info": {
       "nuts_code": "AT21"
      }
    },
    {
      "name": "niederoesterreich",
      "type": "excel",
      "path": "energiebilanz_niederoesterreich_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065503",
      "info": {
        "nuts_code": "AT12"
      }
    },
    {
      "name": "oberoesterreich",
      "type": "excel",
      "path": "energiebilanz_oberoesterreich_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065504",
      "info": {
        "nuts_code": "AT31"
    }
    },
    {
      "name": "salzburg",
      "type": "excel",
      "path": "energiebilanz_salzburg_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065505",
      "info": {
        "nuts_code": "AT32"
      }
    },
    {
      "name": "steiermark",
      "type": "excel",
      "path": "energiebilanz_steiermark_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065506",
      "info": {
        "nuts_code": "AT22"
      }
    },
    {
      "name": "tirol",
      "type": "excel",
      "path": "energiebilanz_tirol_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065507",
      "info": { 
    "nuts_code": "AT33"
      }
    },
    {
      "name": "vorarlberg",
      "type": "excel",
      "path": "energiebilanz_vorarlberg_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065508",
      "info": {
        "nuts_code": "AT34"
      }
    },
    {
      "name": "wien",
      "type": "excel",
      "path": "energiebilanz_wien_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065509",
      "info": {
    "nuts_code": "AT13"
      }
    }
  ]
}

Do we have to refer to the glossary? Is it too general? The definition can change and can come from different prospective.

Hi Ludwig / I noticed your oedb ‘language’ metadata field contained the values “eng” and “ger”. You may be better using standardized IETF language tags like “en” or “en-US” or “de” or “de-DE” instead. These tags are more widely recognized.

I try to be close to the INSPIRE metadata standard for geodata.
In 2.2.7 Resource Language its defined to use ISO 19115 -> (ISO/TS 19139) based on alpha-3 codes of ISO 639-2.

Thanks to Robbies feedback I can announce a new version (v1.3) of the oedb metadata set:
The main topics included in the set are:
Title
Description
Language
Reference Date
Spatial
Temporal
Sources
License
Contributors
Fields

Major changes are temporal information for time series and additional info for the resources to be also applicable for non database files.

The OEP metadata v1.3 (see above) is in use for 1 year now. We are currently discussing an update to v1.4:

There will be an RDF version that is connected to the ontology (also in progress).
Everybody is invited to join the discussion and contribute! Deadline is 7.9.2018.

This thread on automatically recovering metadata from OKI data packages might be relevant too.

An advanced metadata version has been created. see Do-a-thon: Draft for 'energy' datapackage standard (?)