Breakout group about metadata

frankfurt-2017
breakout-group

#1

Here we want to discuss the issue of providing metadata about data.

-what should be the information to be included in the metadata for tables and values?
-the idea is to define a minimum set of data which need to be provided
-the type of the data provided defines the information to be provided about it (standards need to be defined)
-the terms used should either be used in the glossary of the openmod wiki or defined there when they are not available in the glossary


Do-a-thon: Draft for 'energy' datapackage standard (?)
#2
{ "title": "Austria final energy consumption",
  "description": "Final energy consumption per region and sector",
  "license": "Unknown",
  "keywords": [
    "Austria",
    "Energy",
    "Consumption",
    "Final energy consumption",
    "Time series"
  ],
  "version": "0.1",
  "last_updated": "2015-12-31",
  "image": "http://www.statistik.at/wcm/fragments/defaults/assets/logo_statistik.gif",
  "sources": [
    {
      "name": "Statistik Austria",
      "web": "http://www.statistik.at/web_de/statistiken/energie_umwelt_innovation_mobilitaet/energie_und_umwelt/energie/energiebilanzen/index.html"
    }
  ],
  "output": "output_consumption.csv",
  "info": {
    "note": "",
    "energy_definition": "final gross?",
    "energy_source": "total",
    "energy_type": "total",
    "created_on": "2017-04-03",
    "unit": "TeraJoule",
    "energy_vector": "total",
    "owner": "Statistik Austria",
    "measure_type_eustat_flag": "a",
    "nuts_level": 3,
    "nuts_label": "",
    "sector_eustat_code": "",
    "publication_year": 2016,
    "description": "",
    "geom": ""
  },
  "read": {
    "skip_rows": 23,
    "skip_footer": 9,
    "index_col": 0
  },
  "resources": [
    {
      "name": "burgenland",
      "type": "excel",
      "path": "energiebilanz_burgenland_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065501",
      "info": {
       "nuts_code": "AT11"
      }
    },
    {
      "name": "kaernten",
      "type": "excel",
      "path": "energiebilanz_kaernten_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065502",
      "info": {
       "nuts_code": "AT21"
      }
    },
    {
      "name": "niederoesterreich",
      "type": "excel",
      "path": "energiebilanz_niederoesterreich_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065503",
      "info": {
        "nuts_code": "AT12"
      }
    },
    {
      "name": "oberoesterreich",
      "type": "excel",
      "path": "energiebilanz_oberoesterreich_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065504",
      "info": {
        "nuts_code": "AT31"
    }
    },
    {
      "name": "salzburg",
      "type": "excel",
      "path": "energiebilanz_salzburg_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065505",
      "info": {
        "nuts_code": "AT32"
      }
    },
    {
      "name": "steiermark",
      "type": "excel",
      "path": "energiebilanz_steiermark_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065506",
      "info": {
        "nuts_code": "AT22"
      }
    },
    {
      "name": "tirol",
      "type": "excel",
      "path": "energiebilanz_tirol_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065507",
      "info": { 
    "nuts_code": "AT33"
      }
    },
    {
      "name": "vorarlberg",
      "type": "excel",
      "path": "energiebilanz_vorarlberg_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065508",
      "info": {
        "nuts_code": "AT34"
      }
    },
    {
      "name": "wien",
      "type": "excel",
      "path": "energiebilanz_wien_1988_bis_2015_detailinformation.xlsx",
      "web": "http://www.statistik.at/wcm/idc/idcplg?IdcService=GET_NATIVE_FILE&RevisionSelectionMethod=LatestReleased&dDocName=065509",
      "info": {
    "nuts_code": "AT13"
      }
    }
  ]
}

#3

#4

Do we have to refer to the glossary? Is it too general? The definition can change and can come from different prospective.


#5

Hi Ludwig / I noticed your oedb ‘language’ metadata field contained the values “eng” and “ger”. You may be better using standardized IETF language tags like “en” or “en-US” or “de” or “de-DE” instead. These tags are more widely recognized.


#6

I try to be close to the INSPIRE metadata standard for geodata.
In 2.2.7 Resource Language its defined to use ISO 19115 -> (ISO/TS 19139) based on alpha-3 codes of ISO 639-2.


#7

Thanks to Robbies feedback I can announce a new version (v1.3) of the oedb metadata set:
The main topics included in the set are:
Title
Description
Language
Reference Date
Spatial
Temporal
Sources
License
Contributors
Fields

Major changes are temporal information for time series and additional info for the resources to be also applicable for non database files.