Skip to content

POPIMPACT - Data harmonisation #16

@MayteTDGeograma

Description

@MayteTDGeograma

Describes the harmonization issues encountered in the POPIMPACT use case.

Implementation state

In progress

Use Case Scope

POPIMPACT- The object of this work is to obtain a layer of pan-European buildings to which the population has been assigned.

Spatial Scope / Priority Scope

3 scenarios, with 5 proofs of concepts, have been worked on to validate the use case

POC Definition Scope
POC001 cadastral buildings and 1km grid population Seville (Picture1)
POC002 cadastral buildings and 250m grid population Seville (Picture1)
POC003 cadastral buildings and census sections population Seville (Picture1)
POC004 Cadastral buildings of France and 1km grid population France (Picture2)
POC005 Cadastral buildings of La Palma and 250m grid population Area affected by the volcano of La Palma (Picture3)

Picture1.-Seville

image

Picture2.-France

image

Picture3.-La Palma--
image

#Source Schema and Format

Building layer
POCs Layer Format EPSG
POC001,POC002,POC003 A.ES.SDGC.BU.41900.building GML 25830
POC004 Bâtiments Shaperfile 2154,4326
POC005 A.ES.SDGC.BU.38045.building GML 32628
Population Layer
POCs Layer Format EPSG
POC001,POC004 JRC_POPULATION_2018 Shaperfile 3035
POC002 mep19_250 Shaperfile 25830
POC003 población-2019-secciones-censales-por-lustros-2 json 4326
POC005 Poblacion GeoJSON 4326

Source Dataset(s)

Building layer
POCs Layer URL
POC001,POC002,POC003 A.ES.SDGC.BU.41900.building http://ovc.catastro.meh.es/INSPIRE/wfsCP.aspx?
POC004 Bâtiments https://geoservices.ign.fr/parcellaire-express-pci
POC005 A.ES.SDGC.BU.38045.building http://ovc.catastro.meh.es/INSPIRE/wfsCP.aspx?
Population Layer
POCs Layer URL
POC001,POC004 JRC_POPULATION_2018 https://ec.europa.eu/eurostat/web/gisco/geodata/reference-data/population-distribution-demography/geostat
POC002 mep19_250 https://www.juntadeandalucia.es/institutodeestadisticaycartografia/datosespacialesestadisticos/index.htm
POC003 población-2019-secciones-censales-por-lustros-2 https://opendata.esri.es/datasets/ideSEVILLA::poblaci%C3%B3n-2019-secciones-censales-por-lustros-2/explore?location=37.392686%2C-5.992520%2C15.00
POC005 Poblacion https://www.opendatalapalma.es/datasets/poblaci%C3%B3n/explore?location=28.578079%2C-17.922026%2C13.71

Source Data Size

Layer Size Records
Building A.ES.SDGC.BU.41900.building 98 MB 58718
Bâtiments 21 GB 53851462
A.ES.SDGC.BU.38045.building 13 MB 12381
Population JRC_POPULATION_2018 676 MB 2416631
mep19_250 32 MB 52599
población-2019-secciones-censales-por-lustros-2 376 kB 531
Poblacion 5952 kB 11621

Target Schemas

The target schema used is based on:Buildings - Annex 3
link: https://inspire.ec.europa.eu/data-model/approved/r4618-ir/html/index.htm?goto=2:3:2:2:7911
image

Transformed Data size

POC Data result Size Records
POC001 results01_bulding_124a9cad_6d21_4037_a362_f41821d35054 35 MB 58718
POC002 results02_bulding_bb578db5_79c1_4a5c_a12c_9464437438ea 33 MB 58718
POC003 results03_bulding_b653d3c7_f678_4d74_a7ef_14796a0e16e6 35 MB 58718
POC004 results04_bulding_af1cb4a3_4694_4a82_853d_4f731b4e8647 31 GB 51896280
POC005 results05_bulding_b48186b6_28fd_49d6_89f4_fc67d3ea11b4 1096 kB 2599

Target Data set resources

None

Effort / Time Spent

POC Definition Time Spent
POC001 cadastral buildings and 1km grid population 00:19:57.556388
POC002 cadastral buildings and 250m grid population 00:10:39.677794
POC003 cadastral buildings and census sections population 01:24:12.19327
POC004 Cadastral buildings of France and 1km grid population 144 hr.
POC005 Cadastral buildings of La Palma and 250m grid population 00:00:24.621887

Transformation Project

None

Harmonisation problem

Harmonisation problem Definition Example
Difficulty of access It is very difficult to find the datasets for all member states. Buildings in Portugal.
Segmented data When data sets are highly segmented, a process has to be done to bring them together in order to process them as a whole. Building in Spain, separated into more than 8000 municipalities
Diversity of formats Each dataset can be in a different format. Building in France is in GeoJSON and in Spain in GML.
Data without semantic interoperability Each data has its own data model. Sometimes even the same organism does not write the attributes in the same way. Building of Spain different model on La Palma, with data model Inspire.
Different EPSGs Each data has a different EPSG. and even the same datum in different areas. Building of France.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions