Tasking Manager/Validating data
Validating data is an important part of OpenStreetMap: having experienced volunteers check OSM data to make sure it is complete, accurate and meets all the requirements of OpenStreetMap features and tagging.
OSM has valuable resources including the mapping data and more importantly the volunteers who spend time updating and improving it. The process of validating attempts to ensure both these resources are maintained at a high standard, which is to be always improving the quality of the data.
There are many people whose first attempts at mapping are as a result of HOT and they may need help. It can be easy to overlook a feature that requires adding to the data and new mappers often need guidance to help them progress. This wiki contains many resources to aid in this process. There are some who are better at mapping than the validator who may be reviewing their edits. There is much to learn by reviewing entries that have already been validated.
One issue is determining the correct tag. This can be accomplished by using the wiki Map Features page to explore frequently used tags like highway=residential, which may appear vastly different in an area of round buildings thatched with straw and accessed from unsurfaced roads. In addition, the Map Features page contains more obscure tags such as the difference between man-made=silo and man-made=storage_tank instead of a round building=yes
Why, Who, When and What do we validate?
Why do we validate?
OpenStreetMap is often the only source of maps, but the data quality can be very uneven. It is important that the map users have confidence in the mapping on OpenStreetMap so it is up to us to ensure that the data is good and the quality is consistent. Because of the vast amount of mapping that is not yet on OpenStreetMap we are continually onboarding new mappers. These new mappers are busy feeling their way and making errors as they progress to being experienced mappers. Not only do we need to make sure that these errors are corrected but we also need to guide these new mappers so that they do not continue to make the same errors and we want to give them encouragement to keep mapping. Note that OpenStreetMap does not validate in the same way as the HOT tile system on the Tasking Manager. In the HOT Tasking Manager there are data quality tools that are used. If you are mapping in HOT you may receive a message if you have made any tagging errors or have not drawn features correctly.
Who can validate?
In OpenStreetMap anyone can validate as long as they do not validate their own work. However, they can validate another mapper's contributions.
When validating for HOT using the Tasking Manager each project has it's own level of mapping difficulty from Beginner to Intermediate to Advanced. The Project Creator can restrict who can validate according to the difficulty level or complexity of the mapping in their project. With more specific local mapping projects, field data collection or survey projects this can be restricted to a select few validators by creating a Team that is given access to the validation process for that project. HOT has two main validation Teams at present. Validator Trainees for those starting out and HOT Global Validators for the more advanced mappers.
What types of people validate?
All kinds of people validate, however, we have noticed they typically fall into one of these categories:
Although originally written for the HOT Disaster Activation teams, it holds true across all types of mapping efforts.
When do we validate?
The aim is to have 'Stage 1' validating taking place at the same time as the initial mapping, so that any amendments can be made while the nature of the terrain, imagery, and features is still fresh in everyone's mind. Prompt validation is likely to lead to the overall task being completed quicker and any problems can be quickly dealt with. Feedback should be as soon as possible to help and be presented in an encouraging manner. The aim is to prevent the same problem occurring again. If the task has been mapped properly then Stage 1 validating of a task will take considerably less time than the initial mapping did.
Shouldn't we all be doing all of the things listed in Stage 1 validating when we do the initial mapping?
In an ideal mapping scenario the contributor follows the instructions and maps the the tile perfectly. However this is not always the case. If you are validating and find some mistakes it is best to provide feedback to the mapper using the comment section in the Tasking Manager. If a mapper has made mistakes while tagging or squaring buildings they should be directed to the following material:
The following link provides more information on Tagging Buildings: https://wiki.openstreetmap.org/wiki/Key:building
The following link provides more information on Squaring Buildings (with JOSM): https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Buildings_feedback or when mapping with iDeditor: https://learnosm.org/en/hot-tips/tracing-rectangular-buildings/
What do we validate?
Make sure you know exactly what the task requires and have read any wiki pages with more information about the area and tags.
- You may also wish to add the JOSM Style 'HOT-OSM-Validation' , which changes the colours of the tagged highways to indicate if they comply with Highway Tag Africa. Instructions on how to load it are at JOSM Style wiki . Makes validating much easier - if you need to see more details & less of the 'Data Layer' in for instance residential areas, remember to toggle backwards & forwards into wireframe mode with Ctrl+w.
- It is recommended to use JOSM for validation. One of the common problems in a task is highways that end prematurely or do not have a node where they cross over one another. This means that routing software, that is, "find me the way from A to B" cannot be implemented as it does not recognize that two highways are connected if there is a half inch gap between them. JOSM validation will detect them. It is recommended that crossing ways, duplicates, overlapping ways, unclosed ways, and untagged ways are checked with JOSM validation.
For detecting duplicate buildings Mike Thompson was kind enough to build a JOSM script. https://github.com/MikeTho16/JOSM-Scripts
* Install JOSM's Scripting Plugin
* Place above file in a convenient location on your system
* Click "Scripting" (on top menu bar)
* Click "Run"
* Click "..." button and select this file.
* Click "Run"
Selects duplicate, or near duplicate, area buildings in JOSM's active data layer. A "near duplicate" is a building whose bounding box overlaps another building's bounding box by more than 50% (I couldn't get a pure area compare to work). Only the first building encountered of an overlapping pair is selected. The selected buildings are added to the current selection. Note clear the selection first then use the todo plugin to go through the duplicated buildings.
Stage 1. At neighborhood/TM task scale
This process is described for JOSM users, which includes a 'validator'. It is possible to validate using other applications and tools – if you are an expert in these please add details below.
- Using the Tasking Manager (TM), load a square into JOSM.
Have an initial quick scan around to see if you will be continuing or will be invalidating the square. If it is obvious that major improvements need to be made, click on the 'invalidate' button & add a comment explaining what needs to be done. The most frequent problem is a simple one that anyone can make, and the square has somehow been marked as complete, when it is obvious that the whole or majority of the square is still to be mapped – this type of error does not really need a message to the mapper, and may even be a software glitch (That's what we all claim). Do not invalidate a square for something that is not a requirement of the task. You may still invalidate a task later if it is later found that the requirements are not met. With JOSM when scrolling through the tile <Crtl><Down arrow> is very effective. A large high quality screen on a desktop shows much more detail than a fourteen inch laptop. Typically you willl find that zooming to one level less than you would when mapping works. In urgent cases zoom in as normal.
- Now check if buildings are missing. Not all buildings are rectangular - in this image we can see some round housing units or huts (tagged as 'building=hut'). If there are a large number of entities that should be mapped it can be invalidated. If there are only a few then it is simpler and faster for the validator to make these corrections while examining the tile. With the density of habitations in some areas it is quite common that buildings may be missed.
- Are the buildings correctly traced? Sometimes mappers do not trace buildings accurately and if this is significant then it needs to be corrected.
- The most frequent errors are buildings that have not been squared. In this case it is best to send a message explaining how buildings should be squared.
- Note: If you find one building that was not squared you should search for more. This can be done in JOSM by searching "building nodes:-7". Then select the mapper and press "q". You should leave a comment advising the mapper that their buildings were not squared. Information on how to square a building for JOSM and iD is often included in the Tasking Manager where the project is listed or under the Instructions tab. The JOSM building_tool plugin is useful for adding missing buildings. You can also use this technique for moving buildings that are out of alignment. Select them and then move them all at once.
Paste the following into a message about Buildings for the mapper concerned: Help here https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Buildings_feedback
- Are highway tags are correct? Some mappers tag every highway as a track because they are not paved. This is not a meaningful criteria because 99% of the roads are unpaved in developing countries. Check the wiki definitions and the Highway_Tag_Africa before tagging any highways.
- Should it be a path or an unclassified? Be careful about subjective interpretations. This depends on the type of vehicle (is it a four by four?) that will be used to access it as well as the expertise of the driver. Leave it as it is unless you can clearly identify two parallel tracks and that one is definitely wide enough for a four by four vehicle. Tracks are different therefore you must inspect closely and change the tag if necessary. The JOSM To Do list plugin is invaluable especially when you are faced with fifty tracks that need to be individually checked to see if they should be unclassified etc. You can use this plug-in to make notes to your self on the map as you review the area. Remember that the season can affect what is seen in the image. If its recently rained then the highways will be narrower.
- Should a highway be unclassified or unclassified and surface unpaved? There is much debate about this issue. From a satellite image you cannot tell if it is paved with a layer of dust over the top or unpaved. This can only be accomplished in person by sweeping the highway with a brush and dustpan. When I map I often will use surface=unpaved but when I am validating I will probably leave it as the previous mapper tagged it.
- Road geometry. Some mappers do not put enough details and others too much (for example, a node every 10 or 20 m even if the road is straight). The first case is quickly corrected with the (magical) Improve Way Accuracy mode in JOSM. The second case, when the extra nodes actually make a straight road weave requires deleting the extra nodes.
- Start/End of Roads. Some mappers produce long winding ways which meander through several junctions. Click on the 'way' and break it into more obvious parts, probably at junctions or changes in the tagging. Some other streets or roads are broken up without any reason (same tags for all the sections), and need to be combined.
- See Stage 2: Town or city scale validation regarding further problems with highways, some of which may be visible within the single square you are validating.Other highways will cross over more than one tile.
- Connections between objects. There are some that entities that should be connected and those that should not. Frequent problems include residential boundaries being connected to highway or building nodes. This can be determined by using the JOSM validator plug-in.
- Want to know when the object was mapped and who mapped it? Highlight the object in JOSM. Look under the view menu for advanced info or use <crtl>i. At the top you will see the mapper’s name. If they have thousands of edits to their credit and you have only a few you should double check your own interpretation. The https://wiki.openstreetmap.org/wiki/Map_Features page provides useful feedback in this area.
- If you see a zebra crossing on a path in Africa just delete (zap) the tag. The iD editor seems to add these on occasion. It is a type of pedestrian crossing that is probably only found in the United Kingdom and is not related to actual zebras.
- If there are two landuse=areas that overlap select them both and use the <shift>j keys to join them. This happens on a tile boundary where two mappers have drawn the line down the edge of the tile.
- Changeset comment: While you are reviewing the tile for validation you may notice that the mapper has not included the changeset comment, in which case send this link to them through the OSM messenger system https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Changeset_comment_missing. It provides instructions on how and why to use the changeset comment feature.
- When you are satisfied your loaded square is completed, upload any changes and press the 'validate' button on the TM. Add a comment in the validation box that acknowledges the mappers' efforts, provides constructive feedback and if necessary links to information sources and any other relevant comments. For example, the following comment "All complete, thanks. Couple of minor errors with crossing buildings highlighted by the validator programme & I've fixed them" provides feedback and a thank you. You may also send a message directly to the mapper concerned but it may be better if you first review any other squares they have mapped. To send a message directly just type their username like this: @Gollum (or if username has spaces: @[Bilbo Baggins]) in the comment box in the Tasking Manager. You will find that many of the mappers are new to OSM especially during a humanitarian crisis. It is worth letting them know that their work was satisfactory and it would be appreciated if they did some more.
Stage 2: Town or city scale
- Load as many of the tiles into JOSM as you are able, covering the area of the tasking manager project. You may have to do a larger town or city in stages. If a project is not complete yet, you might do this just over areas which are fully validated and green.
- Run the JOSM Validator and make corrections where possible. If the Stage 1 validators have being making the correct entries this process should not take long.
- As you are now looking at a much larger area it should be apparent which sections of highway should be joined together and tagged as Primary, Secondary and Tertiary, for example. They are often larger and frame a larger area or can be a parallel way to main roads. It is also important to check where they start and when they stop, which is often not possible to do when you are looking at individual squares in the Tasking Manager as the highways run across more than one tile.
Missing parts of Highways
The following image looks unusual on Mapnik. If you check this the imagery confirms the two sides of the road separated by a drain are not finished.
In the image below we would expect the two unclassified roads to be connected and not joined by a simple path. The imagery confirms that the southern highway looks the same, and should be tagged the same. In this example it shows that in the farther south section the highway changes to tertiary. This may not be apparent in the imagery. It should be cut when it becomes a straight road.
Isolated sections of Highways
Highways that can be driven upon cannot be isolated and connected to the drivable road network by paths. Instead they must be connected to it by driveable highways.
In this example the primary and tertiary road are connected by an unclassified road. And the same thing is occurring between the two tertiary roads. After having checked the imagery this road would deserve to be tagged as tertiary. This requires that you check the streets width to identify the main ones that needs not to be tagged as residential but unclassified or even tertiary.
Useful Tools for Validating
- In Tasking Manager version 3 several tools were introduced to allow validators to select multiple tasks to validate at once
- Note: multi-task validation should be used very carefully and should not replace or be used as an excuse to verify every task
- In version 4 of the Tasking manager, the following multi-task validation functions are available:
- Multiple selection by mouse: in the contribution tab, you can click multiple tasks and select them for validation
- Selecting by user: in the contributions tab, select all the tasks mapped by a certain user, see image for example
- Note that you can also filter by experience level or new users of the Tasking Manager
- Visit LearnOSM Module: Tasking manager Mapper's Guide - Validation For more information
Main list is at Quality_Assurance
- Osmose Web browser based. Whole world coverage. Multilingual
- Keep_right Web browser based. Whole world coverage. Multilingual
- That_Shouldnt_Be_Possible Web browser based. Requires a gps trace of the area / route you wish to check. At present only limited parts of the world are covered.
- Visualizing OSM data with cartodb to aid HOTOSM validation. Please see this blogpost
Messages to Mappers
Should you comment every time someone does something wrong? One mapper quite often misses a settlement, however they’ve mapped over a thousand tiles. Most of the time I just add the settlements and do not send comments to the mapper who missed this.
If the entries are more than a month old you should just correct, validate and do not bother to comment. Many mapathon mappers only map once and even if they are still mapping another validator may have commented and they may be mapping correctly now. Multiple messages for something they did a few times months ago lessens the fun.
You don't have to send a message to the mapper involved, but if you are going to, this section may help. If your message concerns a problem with the mapping, see the links which are being developed for adding to your message - Problems with Buildings is the first.
Of course, you may send your own message to any mapper, Johnwhelan has been known to use:
For a mapper who has omitted fifty settlements, "added fifty settlements" and validate it, it gets the message to be more careful across.
For a mapper who has carefully filled the tile with every path and track, or has left settlements with no connections. "We're interested in where people live and how to get there so tracks leading to fields are of less interest." Link to African Highway wiki.
For mappers who leave ways or areas untagged, "tagged twenty buildings / highways." and validate it, it gets the message to be more careful across.
For a mapper who has tagged every highway as a road. "JOSM validation has identified twenty three highway=road as a temporary highway type. highway=road is historical from when the main source of input was a GPS track. These were converted to highways but could be either a motorway or a highway=footway with steps and needs closer inspection to add the correct tag. As a result many routing systems will ignore them. In this area highway=unclassified is the most common highway type."
For a mapper who tags highways between settlements as highway=pedestrian in rural areas. "The African Highway wiki recommends the use of highway=path. These are often used by small motorcycles."
"JOSM validation detected seventeen crossing highways and eight highways that almost met. This confuses the routing software and has been corrected."
Tallguy is much more tactful.
This is a template of a message being sent by Tallguy which may give you some ideas about what you could include.
"Hi & thank you for helping the Humanitarian OpenStreetMap Team (HOT).
I am an experienced mapper (still learning though!), and have looked through one of the squares you have completed for task from the OSM Tasking manager. You got it right, thanks!
If you are a regular contributor to HOT then keep up the good work.
If this is one of your first contributions, then thank you & please keep up the good work. There are many tasks outstanding in http://tasks.hotosm.org/ , all are for very good causes & it would be great if you could keep contributing.
You could also help HOT with more than just mapping. Here are a few ideas; Validating – there's a section on this in https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager Perhaps if we each validated 3 squares and mapped one? We could learn a lot from others & help to raise the standard of mapping throughout. Sometimes you map a square & think “Why doesn't that highway come into my square from that square – when you've finished your square take a moment to have a look – perhaps it needs a lot more work & shouldn’t be left showing as complete? You could also get involved in Stage 2 validation?
Take a moment to read http://learnosm.org/en/ which contains sections on advanced mapping. After 5 years of mapping I read it & still learnt quite a lot,
Help with translating; http://learnosm.org/en/ is multilingual (English | Bahasa Indonesia | 日本語 | Português | Italiano | Français | Deutsch | Español ) but needs more translated especially on the advanced sections for JOSM, https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager (either edit the original, or add to Talk section, or send to Tallguy for adding in – good screenshots are also a help),
Software writers. Could you help the team with https://github.com/hotosm/osm-tasking-manager
https://hot.openstreetmap.org/ and https://wiki.openstreetmap.org/wiki/Humanitarian_OSM_Team will tell you more about HOT. You can keep up to date about HOT, or particular areas by subscribing to the relevant lists at https://lists.openstreetmap.org/listinfo . There is also irc://irc.oftc.net #hot for a link to others mapping for HOT, and the page https://wiki.openstreetmap.org/wiki/Irc lists all of the IRC channels if you have an interest in a particular area.
Keep up the good work & I hope to see more squares completed by you in the future.
Other Validation material:
HOT validation Webinar Training - March 2019 - https://youtu.be/ee3XkQ2VwaY
Missing Maps validation page - https://www.missingmaps.org/validate/