ODA/Finnish SOTE registry import

From OpenStreetMap Wiki
Jump to navigation Jump to search

Introduction

The National Institute for Health and Welfare (THL) maintains a registry for all public and private social and health care providers in Finland.
The ODA Self care and digital value services -project would like to use the data in this registry for making sure that all the providers citizens visit are indeed listed on OSM.
The registry is available on Code Service under CC BY 4.0 license.
Written permission to use the data has been obtained from THL. THL would require that they are mentioned as a data source on the Contributors wiki page.

Goals

Aim of the import is to geocode the addresses on the THL registry and insert the locations on OSM.

Schedule

Plan is to do the import as soon as possible. Following topics need to be considered:

  • Licensing rights (CC-BY 4.0), OpenStreetMap Foundation has now guidelines. Written permission by THL has also been received
  • Data preparation, i.e. how to prepare the data for the import
  • Data merge, i.e. how to merge the prepared data to the OpenStreetMap, in case there are existing locations for the same service providers.

Import Data

Background

Data source site: https://www.thl.fi/en/web/information-management-in-social-welfare-and-health-care/standardisation-of-data-and-requirements/code-service
Data license: http://creativecommons.org/licenses/by/4.0/
Type of license: CC-BY
Link to permission: signed permission statement
OSM attribution: to be added to Contributors
ODbL Compliance verified: yes

OSM Data Files

Files will be prepared using the scripts on Github.
Example output available on output.

Import Type

New organizations are registered continuously to the registry. This means the import needs to be updated with the new and changed data periodically.<br>

Update will be done using scripts to process the data but with manual upload.

JOSM will be used for entering the changed data to the OSM database.

Data Preparation

Data Reduction & Simplification

Several of the SOTE organizations already appear on OSM. Data cleanup needs to be investigated. We see however also benefits in a bigger data set derived from the registry vs locations uploaded by individuals.

Tagging Plans

Organizations are tagged amenity:clinic=* or amenity:social_facility=*. Sample node:

<node id='-45' lat='61.495632' lon='23.770766' timestamp='2016-02-02T01:30:24'>
    <tag k='name' v='Hammaslääkintäyritys Plakki Oy'/>
    <tag k='addr:city' v='Tampere'/>
    <tag k='addr:street' v='Suvantokatu'/>
    <tag k='addr:housenumber' v='10'/>
    <tag k='addr:postcode' v='33100'/>
    <tag k='sote_oid' v='1.1.111.1111.1111111.11.11'/>
    <tag k='addr:country' v='FI'/>
    <tag k='amenity' v='clinic'/>
    <tag k='phone' v='050123456'/>
</node>

Changeset Tags

Will use comment=* and created_by=* tags to indicate that we uploaded the SOTE registry locations and that the changeset was crafted by the ODA effort.

Data Transformation

Addresses of the organizations are geocoded to coordinates using Digitransit.fi

Data Transformation Results

finland_output.osm

Data Merge Workflow

Team Approach

The import is being done as part of the ODA Self care and digital value services effort.

References

-

Workflow

  • Open tampere_output.osm on josm-tested.jar
  • Review data
  • Upload data in one changeset
  • Review upload results, if OK create & upload changeset for rest of Finland
  • Use JSOM Reverter plugin for rolling back changeset(s) if needed

Conflation

-

QA

Add your QA plan here.