Tasking Manager/Validating data
Validating data is an important part of OpenStreetMap: having experienced volunteers check OSM data to make sure it is complete, accurate and meets all the requirements of OpenStreetMap features and tagging.
OSM has valuable resources including the mapping data and more importantly the volunteers who spend time updating and improving it. The process of validating attempts to ensure both these resources are maintained at a high standard, which is to be always improving the quality of the data.
There are many people whose first attempts at mapping are as a result of HOT and they may need help. It can be easy to overlook a feature that requires adding to the data and new mappers often need guidance to help them progress. This wiki contains many resources to aid in this process. There are some who are better at mapping than the validator who may be reviewing their edits. There is much to learn by reviewing entries that have already been validated.
One issue is determining the correct tag. This can be accomplished by using the tagging wiki such as frequently used tags like 'highway=residential', which may appear vastly different in an area of round buildings thatched with straw and accessed from unsurfaced roads. In addition, the tagging wiki contains more obscure tags such as 'wadi',
- 1 Why, When and What do we validate?
- 2 Stage 1. At neighborhood/TM task scale
- 3 Stage 2: Town or city scale
- 4 Useful Tools for validating
- 5 Messages to Mappers
Why, When and What do we validate?
Why do we validate?
“OpenStreetMap is often the only source of maps, but the data quality is very uneven. I wish they’d put their more experienced mappers onto validation.” This is a quote from an individual who used OpenStreetMap data (HOT) in the field. Therefore this is why we try to improve the data quality. Note that OpenStreetMap does not validate in the same way as HOT tile system does. In HOT there are data quality tools that are used. If you are mapping in HOT you may receive a message if you have made any tagging errors.
Although anyone can validate do not validate your own work. However, you can validate another mappers contributions. To do this review the instructions for the task. If you are confident you understand what is required and how to fix any issues that may be present you may proceed. More people are needed to carry out validating but it needs to be done properly.
When do we validate?
The aim is to have 'Stage 1' validating taking place at the same time as the initial mapping, so that any amendments can be made while the nature of the terrain, imagery, and features is still fresh in everyone's mind. Prompt validation is likely to lead to the overall task being completed quicker and any problems can be quickly dealt with. Feedback should be as soon as possible to help and be presented in an encouraging manner. The aim is to prevent the same problem occurring again. If the square has been mapped properly then Stage 1 validating of a square will take considerably less time than the initial mapping did.
Shouldn't we all be doing all of the things listed in Stage 1 validating when we do the initial mapping?
In an ideal mapping scenario the contributor follows the instructions and maps the the tile perfectly. However this is not always the case. If you are validating and find some mistakes it is best to provide feedback to the mapper using the comment section in the Tasking Manager. If a mapper has made mistakes while tagging buildings they should be directed to this material:
The following link provides more information on Buildings: https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Buildings_feedback
What do we validate?
Make sure you know exactly what the task requires and have read any wiki pages with more information about the area and tags.
- You may also wish to add the JOSM Style 'HOT-OSM-Validation' , which changes the colours of the tagged highways to indicate if they comply with Highway Tag Africa. Instructions on how to load it are at JOSM Style wiki . Makes validating much easier - if you need to see more details & less of the 'Data Layer' in for instance residential areas, remember to toggle backwards & forwards into wireframe mode with Ctrl+w.
- It is recommended to use JOSM for validation. One of the common problems in a task is highways that end prematurely or do not have a node where they cross over one another. This means that routing software, that is, "find me the way from A to B" cannot be implemented as it does not recognize that two highways are connected if there is a half inch gap between them. JOSM validation will detect them. It is recommended that crossing ways, duplicates, overlapping ways, unclosed ways, and untagged ways are checked with JOSM validation.
For detecting duplicate buildings Mike Thompson was kind enough to build a JOSM script. https://github.com/MikeTho16/JOSM-Scripts
* Install JOSM's Scripting Plugin
* Place above file in a convenient location on your system
* Click "Scripting" (on top menu bar)
* Click "Run"
* Click "..." button and select this file.
* Click "Run"
Selects duplicate, or near duplicate, area buildings in JOSM's active data layer. A "near duplicate" is a building whose bounding box overlaps another building's bounding box by more than 50% (I couldn't get a pure area compare to work). Only the first building encountered of an overlapping pair is selected. The selected buildings are added to the current selection. Note clear the selection first then use the todo plugin to go through the duplicated buildings.
Stage 1. At neighborhood/TM task scale
This process is described for JOSM users, which includes a 'validator'. It is possible to validate using other applications and tools – if you are an expert in these please add details below.
- Using the Tasking Manager (TM), load a square into JOSM.
Have an initial quick scan around to see if you will be continuing or will be invalidating the square. If it is obvious that major improvements need to be made, click on the 'invalidate' button & add a comment explaining what needs to be done. The most frequent problem is a simple one that anyone can make, and the square has somehow been marked as complete, when it is obvious that the whole or majority of the square is still to be mapped – this type of error does not really need a message to the mapper, and may even be a software glitch (That's what we all claim). Do not invalidate a square for something that is not a requirement of the task. You may still invalidate a task later if it is later found that the requirements are not met. With JOSM when scrolling through the tile <Crtl><Down arrow> is very effective. A large high quality screen on a desktop shows much more detail than a fourteen inch laptop. Typically you willl find that zooming to one level less than you would when mapping works. In urgent cases zoom in as normal.
- Now check if buildings are missing. Not all buildings are rectangular - in this image we can see some round housing units or huts (tagged as 'building=hut'). If there are a large number of entities that should be mapped it can be invalidated. If there are only a few then it is simpler and faster for the validator to make these corrections while examining the tile. With the density of habitations in some areas it is quite common that buildings may be missed.
- Are the buildings correctly traced? Sometimes mappers do not trace buildings accurately and if this is significant then it needs to be corrected.
- The most frequent errors are buildings that have not been squared. In this case it is best to send a message explaining how buildings should be squared.
- Note: If you find one building that was not squared you should search for more. This can be done in JOSM by searching "building nodes:-7". Then select the mapper and press "q". You should leave a comment advising the mapper that their buildings were not squared. Information on how to square a building for JOSM and iD is often included in the Tasking Manager where the project is listed or under the Instructions tab. The JOSM building_tool plugin is useful for adding missing buildings. You can also use this technique for moving buildings that are out of alignment. Select them and then move them all at once.
Paste the following into a message about Buildings for the mapper concerned: Help here https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Buildings_feedback
- Are highway tags are correct? Some mappers tag every highway as a track because they are not paved. This is not a meaningful criteria because 99% of the roads are unpaved in developing countries. Check the wiki definitions and the Highway_Tag_Africa before tagging any highways.
- Should it be a path or an unclassified? Be careful about subjective interpretations. This depends on the type of vehicle (is it a four by four?) that will be used to access it as well as the expertise of the driver. Leave it as it is unless you can clearly identify two parallel tracks and that one is definitely wide enough for a four by four vehicle. Tracks are different therefore you must inspect closely and change the tag if necessary. The JOSM To Do list plugin is invaluable especially when you are faced with fifty tracks that need to be individually checked to see if they should be unclassified etc. You can use this plug-in to make notes to your self on the map as you review the area. Remember that the season can affect what is seen in the image. If its recently rained then the highways will be narrower.
- Should a highway be unclassified or unclassified and surface unpaved? There is much debate about this issue. From a satellite image you cannot tell if it is paved with a layer of dust over the top or unpaved. This can only be accomplished in person by sweeping the highway with a brush and dustpan. When I map I often will use surface=unpaved but when I am validating I will probably leave it as the previous mapper tagged it.
- Road geometry. Some mappers do not put enough details and others too much (for example, a node every 10 or 20 m even if the road is straight). The first case is quickly corrected with the (magical) Improve Way Accuracy mode in JOSM. The second case, when the extra nodes actually make a straight road weave requires deleting the extra nodes.
- Start/End of Roads. Some mappers produce long winding ways which meander through several junctions. Click on the 'way' and break it into more obvious parts, probably at junctions or changes in the tagging. Some other streets or roads are broken up without any reason (same tags for all the sections), and need to be combined.
- See Stage 2: Town or city scale validation regarding further problems with highways, some of which may be visible within the single square you are validating.Other highways will cross over more than one tile.
- Connections between objects. There are some that entities that should be connected and those that should not. Frequent problems include residential boundaries being connected to highway or building nodes. This can be determined by using the JOSM validator plug-in.
- Want to know when the object was mapped and who mapped it? Highlight the object in JOSM. Look under the view menu for advanced info or use <crtl>i. At the top you will see the mapper’s name. If they have thousands of edits to their credit and you have only a few you should double check your own interpretation. The https://wiki.openstreetmap.org/wiki/Map_Features page provides useful feedback in this area.
- If you see a zebra crossing on a path in Africa just delete (zap) the tag. The iD editor seems to add these on occasion. It is a type of pedestrian crossing that is probably only found in the United Kingdom and is not related to actual zebras.
- If there are two landuse=areas that overlap select them both and use the <shift>j keys to join them. This happens on a tile boundary where two mappers have drawn the line down the edge of the tile.
- Changeset comment: While you are reviewing the tile for validation you may notice that the mapper has not included the changeset comment, in which case send this link to them through the OSM messenger system https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager/Validating_data/Changeset_comment_missing. It provides instructions on how and why to use the changeset comment feature.
- When you are satisfied your loaded square is completed, upload any changes and press the 'validate' button on the TM. Add a comment in the validation box that acknowledges the mappers' efforts, provides constructive feedback and if necessary links to information sources and any other relevant comments. For example, the following comment "All complete, thanks. Couple of minor errors with crossing buildings highlighted by the validator programme & I've fixed them" provides feedback and a thank you. You may also send a message directly to the mapper concerned but it may be better if you first review any other squares they have mapped. To send a message directly just type their username like this: @Gollum (or if username has spaces: @[Bilbo Baggins]) in the comment box in the Tasking Manager. You will find that many of the mappers are new to OSM especially during a humanitarian crisis. It is worth letting them know that their work was satisfactory and it would be appreciated if they did some more.
Stage 2: Town or city scale
- Load as many of the tiles into JOSM as you are able, covering the area of the tasking manager project. You may have to do a larger town or city in stages. If a project is not complete yet, you might do this just over areas which are fully validated and green.
- Run the JOSM Validator and make corrections where possible. If the Stage 1 validators have being making the correct entries this process should not take long.
- As you are now looking at a much larger area it should be apparent which sections of highway should be joined together and tagged as Primary, Secondary and Tertiary, for example. They are often larger and frame a larger area or can be a parallel way to main roads. It is also important to check where they start and when they stop, which is often not possible to do when you are looking at individual squares in the Tasking Manager as the highways run across more than one tile.
Missing parts of Highways
The following image looks unusual on Mapnik. If you check this the imagery confirms the two sides of the road separated by a drain are not finished.
In the image below we would expect the two unclassified roads to be connected and not joined by a simple path. The imagery confirms that the southern highway looks the same, and should be tagged the same. In this example it shows that in the farther south section the highway changes to tertiary. This may not be apparent in the imagery. It should be cut when it becomes a straight road.
Isolated sections of Highways
Highways that can be driven upon cannot be isolated and connected to the drivable road network by paths. Instead they must be connected to it by driveable highways.
In this example the primary and tertiary road are connected by an unclassified road. And the same thing is occurring between the two tertiary roads. After having checked the imagery this road would deserve to be tagged as tertiary. This requires that you check the streets width to identify the main ones that needs not to be tagged as residential but unclassified or even tertiary.
Useful Tools for validating
Main list is at Quality_Assurance
- Osmose Web browser based. Whole world coverage. Multilingual
- Keep_right Web browser based. Whole world coverage. Multilingual
- That_Shouldnt_Be_Possible Web browser based. Requires a gps trace of the area / route you wish to check. At present only limited parts of the world are covered.
- Visualizing OSM data with cartodb to aid HOTOSM validation. Please see this blogpost
Messages to Mappers
Should you comment every time someone does something wrong? One mapper quite often misses a settlement, however they’ve mapped over a thousand tiles. Most of the time I just add the settlements and do not send comments to the mapper who missed this.
If the entries are more than a month old you should just correct, validate and do not bother to comment. Many mapathon mappers only map once and even if they are still mapping another validator may have commented and they may be mapping correctly now. Multiple messages for something they did a few times months ago lessens the fun.
You don't have to send a message to the mapper involved, but if you are going to, this section may help. If your message concerns a problem with the mapping, see the links which are being developed for adding to your message - Problems with Buildings is the first.
Of course, you may send your own message to any mapper, Johnwhelan has been known to use:
For a mapper who has omitted fifty settlements, "added fifty settlements" and validate it, it gets the message to be more careful across.
For a mapper who has carefully filled the tile with every path and track, or has left settlements with no connections. "We're interested in where people live and how to get there so tracks leading to fields are of less interest." Link to African Highway wiki.
For mappers who leave ways or areas untagged, "tagged twenty buildings / highways." and validate it, it gets the message to be more careful across.
For a mapper who has tagged every highway as a road. "JOSM validation has identified twenty three highway=road as a temporary highway type. highway=road is historical from when the main source of input was a GPS track. These were converted to highways but could be either a motorway or a highway=footway with steps and needs closer inspection to add the correct tag. As a result many routing systems will ignore them. In this area highway=unclassified is the most common highway type."
For a mapper who tags highways between settlements as highway=pedestrian in rural areas. "The African Highway wiki recommends the use of highway=path. These are often used by small motorcycles."
"JOSM validation detected seventeen crossing highways and eight highways that almost met. This confuses the routing software and has been corrected."
Tallguy is much more tactful.
This is a template of a message being sent by Tallguy which may give you some ideas about what you could include.
"Hi & thank you for helping the Humanitarian OpenStreetMap Team (HOT).
I am an experienced mapper (still learning though!), and have looked through one of the squares you have completed for task from the OSM Tasking manager. You got it right, thanks!
If you are a regular contributor to HOT then keep up the good work.
If this is one of your first contributions, then thank you & please keep up the good work. There are many tasks outstanding in http://tasks.hotosm.org/ , all are for very good causes & it would be great if you could keep contributing.
You could also help HOT with more than just mapping. Here are a few ideas; Validating – there's a section on this in https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager Perhaps if we each validated 3 squares and mapped one? We could learn a lot from others & help to raise the standard of mapping throughout. Sometimes you map a square & think “Why doesn't that highway come into my square from that square – when you've finished your square take a moment to have a look – perhaps it needs a lot more work & shouldn’t be left showing as complete? You could also get involved in Stage 2 validation?
Take a moment to read http://learnosm.org/en/ which contains sections on advanced mapping. After 5 years of mapping I read it & still learnt quite a lot,
Help with translating; http://learnosm.org/en/ is multilingual (English | Bahasa Indonesia | 日本語 | Português | Italiano | Français | Deutsch | Español ) but needs more translated especially on the advanced sections for JOSM, https://wiki.openstreetmap.org/wiki/OSM_Tasking_Manager (either edit the original, or add to Talk section, or send to Tallguy for adding in – good screenshots are also a help),
Software writers. Could you help the team with https://github.com/hotosm/osm-tasking-manager
https://hot.openstreetmap.org/ and https://wiki.openstreetmap.org/wiki/Humanitarian_OSM_Team will tell you more about HOT. You can keep up to date about HOT, or particular areas by subscribing to the relevant lists at https://lists.openstreetmap.org/listinfo . There is also irc://irc.oftc.net #hot for a link to others mapping for HOT, and the page https://wiki.openstreetmap.org/wiki/Irc lists all of the IRC channels if you have an interest in a particular area.
Keep up the good work & I hope to see more squares completed by you in the future.
Other Validation material:
HOT validation Webinar Training - March 2019 - https://youtu.be/ee3XkQ2VwaY
Missing Maps validation page - https://www.missingmaps.org/validate/