Import of Tenerife's hydraulic infrastructures

From OpenStreetMap Wiki
Jump to navigation Jump to search

Import propossal for http://tareas.openstreetmap.es/project/44

Goal

The Hydrologycal Plan of Tenerife (PHT) contains a list of hydraulic infrastructures (rafts, reservoirs, dams, wells, treatment plants, etc.) that are interesting to incorporate into the map of Tenerife. The list includes for each infrastructure its name, type and coordinates of its location. With this import it is intended:

  1. Merge from this list the name of the infrastructures with those existing in OSM.
  2. Use the coordinates of the list to locate on the aerial image the elements that do not exist in OSM, draw infrastructures that correspond to ways and correct the position -in case of error- if it corresponds to nodes. New elements will be added to OSM only if they can be checked on another data source.

Schedule

This is a long-term work without strict planning. Important milestones will be noted here.

Import Data

Background

Permission

Spanish legal provisions and their corresponding projects are not subjected to copyrights according to Article 13 of the Spain Intellectual Property Law.

Data description

Documento PDF (1.5 Mb) with a list of hydraulic infrastructures. The list is divided into pages, each page contains in the header the type of Infrastructure whose value is used to define to which map feature it corresponds in OSM and apply the corresponding tags. These are the fields from the list that will be used in the import.

  • Denominación (denomination): Infrastructure names.
  • Estado (state): Values EX = Exist, PR = Predicted and PL = Planned. Only existing infrastructures (EX) will be used.
  • X: UTM coordinate X in projection EPSG: 32628.
  • Y: UTM coordinate Y in projection EPSG: 32628.

The list includes 5580 infrastructures.

Import Type

This is a one time, full manual import.

Data Preparation

Data Reduction & Simplification

After filtering only the existing infrastructures and those with coordinates, results a total of 3451 infrastructures. After eliminating the types of infrastructure not selected for import (see below) results a total of 2478 infrastructures.

Tagging Plans

Each element will have the following tags:


Original field OSM tag OSM value Comment
Denominación name=* + alt_name=* * In upcase
X, Y <node lat=* lon=*> Geographic coordinates In EPSG:32628 projection

The value of Tipo de Infraestructura (type of infraestructure) in the page header is used to apply tags acording to this table:

Type of infraestructure Count OSM element Tags Comment Example
- PRESA
- BALSA
- BALSA DE AGUA REGENERADA
146 Area area landuse=reservoir
reservoir_type=water_storage
Not covered water storage infrastructures.

way 28975979
- DEPÓSITO ABASTECIMIENTO
- DEPOSITO CUBIERTO DE AGUA REGENERADA
- DEPÓSITO DE CABECERA AGUA DESALADA
- DEPÓSITO INTERNO EDAM
538 area area building=yes
man_made=reservoir_covered
content=water
Covered water storage infrastructures.

way 208839002
- CENTRAL DE TURBINADO 2 area area building=yes

power=generator generator:method=water-storage

Hydroelectric power plant.

way 476095077

- ESTACIÓN DE BOMBEO AGUA DESALADA
- ESTACIÓN DE BOMBEO ABASTECIMIENTO
- ESTACIÓN DE BOMBEO DE AGUAS REGENERADAS
12 area area building=yes

man_made=pumping_station
pumping_station=water

Water pumping infraestructure.

way 476095088
- EBAR 109 area area building=yes

man_made=pumping_station
pumping_station=wastewater

Waste water pumping infraestructures.

way 476095086

- EDAM
- EDAS
49 area area landuse=industrial
man_made=water_works
Infraestructures to transform sea water (EDAM) or brakish underground water (EDAS).

way 461901634

- ERA
- EDAR
- ETAR
- ESTACIÓN DE TRATAMIENTO PREVIO
101 area area landuse=industrial
man_made=wastewater_plant
Infraestructures related to waste water treatment.

way 189876063
- POZO CONVENCIONAL
- POZO SONDEO
397 Nodo node man_made=water_well Water well for underground water catchment with a vertical drilling.

node 2469328641

- GALERÍA CONVENCIONAL
- GALERÍA NACIENTE
- GALERÍA POZO
- GALERÍA SOCAVÓN
1124 Nodo node man_made=adit
resource=water
Man made horizontal adit to capture underground water.

node 4697452924
- MANANTIAL
- POZO DE CAPTACIÓN DE AGUA DE MAR
- POZO ORDINARIO
- TOMADERO
- CÁMARA DE MEZCLA
- ESTACIÓN METEOROLÓGICA
- SONDEO INVESTIGACIÓN
- PUNTOS DE CONEXIÓN
- PUNTO DE SUMINISTRO DE AGUA DESALINIZADA
- POZO ABSORBENTE
- POZO DE VERTIDO DE SALMUERA AGUA DESALADA
1232 N/A N/A Infrastructures discarded for OSM for different reasons, mainly to be difficult to locate over aerial image.

Changeset Tags

  • source=Plan Hidrológico de Tenerife; other data sources
  • type=import
  • url=http://wiki.openstreetmap.org/wiki/Tenerife_streams_import

Data Transformation

  • A batch script infraesturas.sh is used to:
    • Converts the PDF document to text format.
    • Delete headers, footers and other texts that are not part of the infrastructure data.
    • Filter only existing infrastructures and those with coordinates.
    • Transform names by correcting the use of capital letters, spelling, abbreviations.
    • Separate the main name and the alternative ones if any.
    • Put the initial article at the beginning when they are at the end in parentheses.
    • The generic Deposit, Presa, Gallery, is added at the beginning of the name, depending on the case.
    • Convert the coordinates and reproject to EPSG: 4326.
    • Eliminates the infrastructures of discarded types.
    • Generates an XML OSM file assigning to each element tags according to the tagging plan.

cuadricula.py: generates a geojson file with a 2 km2 grid to split the data in tasks.

split.py: generats OSM files with the data splited by tasks.

Data Transformation Results

The software used and its results are hosted in this repository

  • infraestructuras.csv: Sheet with the infraestructures data.
  • infraestructuras.osm: Full data set to import.
  • task/task???.osm: Data set splited in 449 files.

Workflow

Team Approach

Import will be undertaken by experienced OSM volunteers, following a strict workflow.

Tools

Preliminary steps

  1. Create an account like usuario_imports with an email not already registered in OSM. If you use gmail, you can use your usual address adding "+imports" before the 'at'. Example: peter+imports@gmail.com
  2. Put this account in JOSM settings.
  3. Enable the remote control in JOSM: Settings (F12) > Remote control > Enable remote control.
  4. Install JOSM/Plugins/TODO_list.
  5. Download the desired files from here.
  6. Give authorization to use your openstreetmap account in the tasking manager in the link 'login to OpenStreetMap'.
  7. Select an area and clic in "Start contributing".
  8. Open this area in JOSM with the link 'Edit with Josm'.

Main steps

  1. Load the import OSM file corresponding to this area. You shouldn't upload this file.
  2. Select all in this layer and add to the ToDo list.
  3. Zoom to a node in the ToDo list. Check if it exists in the OSM data.
    1. If not exists and it correspond to an area draw it in the OSM layer using background aerial imagery.
    2. If it don't exists and it correspond to a node check the position and copy the node from the import layer to the OSM layer.
    3. If you can't locate the infraestructure using aerial imagery, ignore it and don't import.
    4. If it exists, check position and copy the name tag.
  4. Repeat until you clear the ToDo list.

Conflation

  1. If the OSM infraestructure it's not located in the position indicated by the PHT, check with aerial imagery.
  2. If there exists a conflict between the name in OSM and PHT, the last is the oficial data.

Last notes

  1. Make sure to use the user created in Josm for the import.
  2. Upload the edits to OSM assigning the #Changeset Tags.
  3. Mark the task as done in the tasking manager.

Regresion plan

In case of any trouble, JOSM/Plugins/reverter will be used.

Updates

These are static data, no updates are expected.

Quality Assurance

Each area edit can be validated in the tasking manager.