Import/Catalogue/Address import for Torino

From OpenStreetMap Wiki
Jump to navigation Jump to search

About

This page talks about importing addresses using the data provided by the Municipality of Torino (Italy).

The import has been discussed on the Piedmontese OSM mailing list and the Italian OSM mailing list. This wiki page is the result of consensus there.

Import Plan Outline

Goals

This import goal is to use the high-quality dataset provided by the Municipality of Torino in order to steadily improve the addresses available in OSM. It will not be a blind import, data will be checked.

Schedule

The import took place from 13 to 15 March 2017.

Import Data

Background

Address format in Torino

House numbering in Torino follows the European scheme.

An address in Torino is determined by its streetname and housenumber.

A housenumber is also unique per street.

Housenumbers can include a subordinate. These are noted with suffix letters (e.g. in "7a", "a" is the subordinate). Subordinates usually arise when a new house is build between existing houses with subsequent housenumbers. E.g. when a house is build between numbers 7 and 9, the new house will most likely get number 7a (since even numbers are reserved for the other side).

Legal

Data source site (as at March 2017): http://aperto.comune.torino.it/?q=node/504
Data license: http://www.dati.gov.it/iodl/2.0/
Type of license: IODL v2.0
OSM attribution: http://wiki.openstreetmap.org/wiki/Contributors#Torino
ODbL Compliance verified: yes

From the IODL 2.0 license (in Italian): "indicare la fonte delle Informazioni e il nome del Licenziante, includendo, se possibile, una copia di questa licenza o un collegamento (link) ad essa."

Translation: "state the data source and the licensor name, including, if possible, a copy of this License or a connection (link) to it."

It should be enough to add the attribution in the Contributors page, like already done for Venice.

Import Type

The dataset will be normalized and data already existing will be put aside.

The existing OpenStreetMap data are not present in the dataset and will be analyzed and merged later.

Data Preparation

Tagging Plans

The data is presented as a csv file. This csv file consists in a collection of punctual elements, one for each housenumber.

Each row has the keys:

  • COD_CIVICO: incremental number for internal reference
  • COMUNE: city name
  • VIA: street name
  • CIVICO: house number
  • CIRCOSCRIZIONE: district
  • CAP: zip code
  • COORDINATA_X: longitude
  • COORDINATA_Y: latitude

The tags that will be used in the final upload are addr:housenumber, addr:street, addr:postcode.

The tags will be as follows:

  • addr:housenumber will contain the housenumber.
  • addr:street will contain the street name normalized to follow Italian conventions.
  • addr:postcode will contain CAP.

Dedicated upload account

The account used for the import is Ale_Zena_IT-import

Changeset Tags

Changeset will be tagged with source=Comune di Torino.

Data Transformation

The original csv file was imported in a Postgis database table. The projection is EPSG:4326 (WGS84 latitude-longitude).

The table was joined with the italian copy of the OSM database (updated several times a day), searching for duplicates.

The points already in OSM (normalized street name and number) where marked and not considered for ther import phase.

Data Merge Workflow

Addresses already in OSM can be extracted using the following Overpass query:

<code>
<osm-script>
  <query into="comune" type="area">
    <has-kv k="admin_level" v="8"/>
    <has-kv k="name" v="Torino"/>
  </query>
  <union>
  <query type="node">
    <area-query from="comune"/>
    <has-kv k="addr:housenumber"/>
  </query>
  <query type="way">
    <area-query from="comune" />
    <has-kv k="addr:housenumber"/>
  </query>
  <item/>
  <recurse type="down"/>
  </union>
<print mode="meta" />
</osm-script>

</code>

Workflow

In order to have manageable data, it will be uploaded a changeset for each CAP (zipcode) zone (there are 36 zones in total).

Before of each upload, points outside the city boundary will be deleted (coordinate error).

In case of import problem the changeset will be reverted using the JOSM Reverter Plugin

QA

A project has been set up at OSM tasking Manager to manage further adjustments raising from names inconsistency.

Street names

After the import, addr:street names could be slightly different than street names.

These differences should be catched using OSM Inspector (map already centered on Torino).

Unmarked streets

The result can be used to locate areas where streets are missing.

Missing roads will be created in JOSM using PCN 2012 areal images.

Unnamed streets

The result can be used to derive street names for unnamed streets when all the nodes along the street has the same addr:street value.

Missing road names will be identified using the OpenStreetMap NoName Map Overlay:
tms:http://tile3.poole.ch/noname/{zoom}/{x}/{y}.png

OSM Inspector can also be used to find these streets.