Import/Catalogue/Hotels-RAFVG-umap

From OpenStreetMap Wiki
Jump to navigation Jump to search

About

This page is about importing Hotels dataset published by Regione Autonoma Friuli Venezia Giulia (RAFVG), Italy. It shall not be a blind import: source data shall be checked by mappers through a support map.

The import is being discussed on the regional OSM mailing list. The import will be the result of consensus there.

Goals

This import aims to have a RAFVG-certified and updated set of POIs (OSM tourism=hotel) for the RAFVG territory (OSM admin_level=4).

Schedule

Starting from March 2019, import will be performed thru Level0 OSM editor and support umap. Progress will be trackable in the same page (orance layer). Depending on mappers involved, import should take 30-300 days to be accomplished.

Import Data

Background

Source dataset contains 738 punctual objects (as oct 2017) w/o geo coordinates; they were "directly managed" by municipalities where each POI has been registered, as stated in metadata page. Since regional OSM addresses were recently (2014) imported from RAFVG dataset, geocoding shall be performed. Dataset addresses were provided by municipalities and compiled by Hotel operators, hence some POIs could be not correctly geocoded: nodes uploaded in umap shall be a subset of source data.

Metadata

As defined in RAFVG metadata page, dataset features the following:

  • provider: Anagrafe regionale delle strutture turistico-ricettive (ARSTR)
  • update frequency: 12 month
  • last update: 10/10/2017
  • refer to: Servizio turismo
  • licenza ed attribuzione: IODL

Legal

Record format and tagging plan

RAFVG dataset table structure will be pruned and adapted thru OpenRefine; fields will be mapped referring to hotel wiki page. Below table lists 18 candidates out of 93 input fields:

Field Value Mapped as Notes
CODICE ESERCIZIO 2 ref=2
comune GRADO geocoding
categoria 3 Stelle *** stars=3
denominazione ZUBERTI name=Zuberti
indirizzo PIAZZA CARPACCIO, 29 geocoding
cap 34073 geocoding
telefono 0431 80196 phone=+39 0431 80196 to be normalized
email info@hotelzuberti.it email=info@hotelzuberti.it
sito http://www.hotelzuberti.it website=http://www.hotelzuberti.it
n_camere 9 rooms=9
n_posti_letto 25 beds=25
periodo_apertura 01.I 31.XII opening_hours=01 Jan - 31 Dec to be normalized
varie_interesse_storico SI historic=yes
accessibile Non accessibile wheelchair=no
ristorante NO restaurant=no
tavola_calda NO fast_food=no
bar SI bar=yes
giochi_bambini SI playground=yes

Import Type

The dataset will be imported on its regional base (OSM admin_level=4). OSM candidate nodes will be presented as pins on a support umap. Manual upload will be accomplished via Level0 editor.

Team Approach

Import will be managed by the following OSM users:

  • Cascafico

Workflow

Step by step operations:

  1. dataset download
  2. OpenrRefine operations
  3. csvgeocode nominatim geocoding
  4. geocoded csv upload to support umap
  5. community editing

In case of import problems, changeset involved will be reverted using proper reverter

Data Preparation

The data is presented as csv "comma separated values" file in a collection of punctual elements, one for each hotel.

Refining

Some normalizations require refining operations, documented [ herein]. Below, a summary of actions performed thru OpenRefine:

  • Remove unuseful columns
  • Standardization of TELEFONO, CELLULARE and FAX
  • Conversion of column PERIODO_APERTURA to OSM opening_hours standard
  • Split column INDIRIZZO by separator ","
  • Some INDIRIZZO abbreviations expanded (ie: Loc. > Località, Fraz. > Frazione)
  • (optional) Reconcile cells in column INDIRIZZO to authorirative dataset
  • (optional) Match each cell to its best recon candidate in column INDIRIZZO

Normalization file

Here you can find dataset.operations file applied to source dataset for refining purposes.

Upload

Data is uploaded manually thru Level0 editor linked in support umap pop-ups. Non dedicated upload accounts.

Changeset Tags

Changesets shall be tagged with: