WikiProject Uganda/Microsoft building imports for UBOS pilot census preparation

From OpenStreetMap Wiki
Jump to navigation Jump to search


Introduction

This activity is an import of building footprints covering the districts of Amuria, Kapelebyong and Butambala in western Uganda in support of the Uganda Bureau of Statistics’ (Uganda Bureau of Statistics ) technical preparations for a census pilot. The import is currently going through the Import Guidelines steps by which the local OSM community in Uganda and the global OSM import mailing list are engaged and consulted in the process.

Goals

For this building import activity, the goals are to provide a building footprint layer to Uganda Bureau of Statistics  to be able to guide the experimentation and improvement of data collection tools, methodologies and workflows deployed in previous censuses in a particular geographic area in Uganda that has been comprehensively mapped remotely. By knowing where and how building structures are located and/or distributed due a comprehensive building footprint map layer, Uganda Bureau of Statistics  will be able to experiment with different approaches for household surveying and ensure the final approach will enable the gathering, analysis and visualization of household data more efficiently and effectively. In more detail, the census pilot aims to achieve the following:

  • Estimate the amount of resources (people and time) required to implement the national data collection activities based on the size/location of the area
  • Determine whether new enumeration areas are needed to accelerate the field data collection process
  • Enable the Uganda Bureau of Statistics  census team to Identify the most suitable methodology for data collection (sampling versus full range coverage) for implementing the national census.


Timeline

In early 2020, HOT received a request from the Manager of Geo-Information Services at Uganda Bureau of Statistics  for support in acquiring building footprints in four districts in Uganda; these datasets would provide a blueprint for parts of the country that have not been previously mapped and help guide the initial planning of the pilot census activity to take place in mid to late 2020.

The process of incorporating building footprints for Uganda (using RapID) began on 13 February 2020 and continued through 30 April 2020. When this was flagged by the DWG on 30 April 2020 as an import, all activities were halted. On 1 May 2020, a handful of quality checks were carried out to ensure data was completely validated. From 1 May onwards, our team has been working to develop and disseminate the import workflow according to the import guidelines.

Note: this data import is not a blind import - the process will be controlled and closely monitored, where users will be expected to check the validity of the data against aerial imagery and to correct the geometry of each single building structure according to the images.

Import data

Data description

Under Microsoft’s AI for Humanitarian Action program, Bing Maps is contributing to an initiative from Humanitarian OpenStreetMap Team that will bring Artificial Intelligence (AI) assistance to open building mapping. In 2019, Bing Maps released country-wide open building footprints datasets in Uganda (and Tanzania). The Uganda dataset contains 6,928,078‬ computer-generated building footprints derived using Bing Maps algorithms on satellite imagery. While this data has been permitted to be uploaded to OpenStreetMap, systematic and close inspection as well as validation will be needed beforehand. Mappers will be trained to avoid over-writing the hard work of other contributors or blindly importing data into OSM without first checking the local quality. While our metrics show that this data meets or exceeds the quality of hand-drawn building footprints, the data does vary in quality from place to place, between rural and urban, mountains and plains, and so on. For this reason, data quality will be inspected in detail, discussed with the local community to ensure there are no gaps in what data already exists in OSM and what data will be imported to OSM. The data is in geojson format and 1339MB in size.

Background

Data site: https://github.com/microsoft/Uganda-Tanzania-Building-Footprints

Data License: This data is licensed by Microsoft under the Open Data Commons Open Database License (ODbL).

ODbL Compliance verified: Yes

Data files

At the start of the import process, a link enabling the download of raw data in geojosn file format will be added here.

Import type

The plan is to import the building footprint data into OSM manually by a team of trained mappers working with HOT Uganda. Data will be split along village administrative boundaries that have been provided to our team from Uganda Bureau of Statistics  for this activity to ease the coordination of task allocation amongst the mapping team. Using the HOT tasking manager, remote mapping task projects will be set up with arbitrary task boundaries corresponding to the village administrative boundaries.  The benefits of this approach include the following:

  • Trained community mappers will ensure proper data quality and can provide feedback quickly to the task management team
  • Mapping progress of the import can be checked and validated at any time through HOT’s tasking manager and OSMcha platforms
  • It is easy to set up and monitor over time

The list of remote mapping tasks created for this project are as follows:

Districts Subcounties Task
Amuria Adepuru 7916
Amuria 8009
Amuria Wilia 8141
Amuria Arbriela 8155
Amuria 8220
Butambala All villages 8180
Kapelebyong All villages 8319
Total

Data Preparation

Data reduction and simplification

During the building importation process, buildings that will have been identified as no longer existing on the ground will not be uploaded to OpenStreetMap. No simplification or transformation of the data is expected, outside of correcting potential offset.

Tagging plans

ML Tags Column Meaning Action OSM tags Action
FID Unique feature Identifier Delete building=”yes” Add tag
source= “Microsoft/BuildingFootprints” Add tag
mapwithai:source=“MapWithAI” Add tag

Changeset tags

Changeset comment:#MapwithAI #Uganda #Uganda Bureau of Statistics census

Import=”yes”

Url:Microsoft buildings import for UBOS Pilot Census Preparation

Data Transformation

The original file (in geojson format) will first be converted into a shapefile format. This will allow the size of the file to be reduced significantly to enable reading in JOSM. The shapefile will then be loaded into JOSM using the open layers plugin where all irrelevant tags will be deleted and the proposed tags will be added. From this point, the resulting file will be split according to Uganda Bureau of Statistics ’ 2011 village boundaries and after saved in geojson format.

Data Merge Workflow

Team approach

This import (data integration) will be done through the HOT Tasking Manager, so the number of people importing the data is known and easily tracked. We expect to use 5 mappers during this import process with the following skills:

  • Good experience with JOSM.
  • Experience with previous imports through the Tasking Manager is not necessary, but a plus.
  • S/he knows how to use JOSM filters.
  • Skilled working with buildings. It will be crucial for mappers to know how to orthogonalize buildings (Q), round buildings (O), combine ways (C) and ungluing (G).
  • S/he knows how to use the ToDo JOSM plugin.
  • S/he knows how to create and deal with building relations.
  • S/he knows how to deal with conflicts.

References

Following the imports guidelines, this import activity will be discussed in the local Uganda OSM mailing list, and later with the HOT mailing list to gather additional feedback/comments on the proposed process.After gathering feedback from the HOT and OSM Uganda mailing list the document will be shared with global OSM import mailing list for any final thoughts and feedback on the import activity.

Workflows

JOSM Workflow Instructions

These remote mapping tasks will be tailored for intermediate and/or advanced mappers only. With that said, strict supervision and monitoring protocols will be applied throughout the activity to ensure high quality mapping.

Once remote mapping tasks are set up, mappers are instructed to navigate the village task assigned to them and download it to his/her JOSM application. Once downloaded in JOSM, the mapper will add the (identified) village boundary to allow him/her to be sure of the mapping limits. Then, the mapper will add the Microsoft AI buildings for that specific village into JOSM as a new layer.

After loading Microsoft AI buildings into JOSM, the mapper must activate the Bing imagery layer. Since Microsoft AI buildings were largely generated using Bing imagery, it is important to use the same imagery to ensure building structures are properly aligned. From this point on, the mapper must assess each building feature individually using the todolist plugin in JOSM to ensure buildings are aligned with Bing imagery. Once this process is complete, the mapper will be required to activate the Maxar imagery service in JOSM. Maxar, in most areas in Uganda, is newer than Bing imagery (2015-2018 for Maxar versus 2011-2013 for Bing) which he/she will use to arbitrate what buildings to import or not. From here, the mapper will also manually trace the missing buildings that could be newly constructed buildings structures. The mapper will also  manually remove building structures that no longer exist based on the newest imagery  and then add the appropriate tags - building=yes, source=Microsoft/BuildingFootprints.

After completing this process, the mapper will apply a second assessment using the todolist plugin for all features added to the layer. The mapper will then run the validation process to identify overlapping, fix wrongly tagged or poorly traced building features. If any errors are identified, the mapper will work to resolve these issues before uploading them to OSM and moving to the next task.

Changeset Policy

Changesets will be quite small in size, so no issues are expected in this regard.

Revert Plan

In case of needing to retract any changes, the activity team will use the JOSM Revert Tool if/when needed as well as keep monitoring and tracking the mapping activity (Changesets made).This process will make it easier to identify errors that need to be reverted.

Conflation

If an existing building in OSM is no longer visible on the most recent imagery (as is the case in this activity because Maxar imagery is more up to date in the AOI), the building will be removed. The mappers will be required to apply proper validation checks of their tasks before uploading to OSM to ensure buildings and highways do not cross or overlap one another.

Quality Assurance

To ensure high data quality, validation of the import tasks will be carried out by a second user in the Tasking Manager. Two Senior Mappers will be dedicated to validating and cross-checking the imported data through OSMcha and the Tasking Manager to ensure data quality during the mapping process.