User:SK53/NHD Upload

From OpenStreetMap Wiki
Jump to: navigation, search

A bit more on NHD Uploads

Contents

Sub-basin progress & plans

Currently working through the 6 Colorado Headwater basins (14010001..6), with Blue River WaterBodies, Flowlines and Lines imported. Starting to import all WaterBody data for the other 5 basins in one sequence of uploads. SK53 21:26, 2 September 2009 (UTC). Data import completed around end Sept 2009. Data tidy-up proceeding tile-by-tile at level 11 (see below).

Process

With Blue River done I'm trying to work out a fairly routine process. Major problem I encountered was bug in bulk_upload.py running on Windows which left lots of lonely nodes all over central Colorado. Result was that I had to do quite a lot of manual reverting, being unwilling to try the perl script. It helps if ways (not nodes) are tagged with some kind of upload sequence to facilitate this type of backing out.

Now, I'm using xapi queries to analyse an area before performing uploads. Together with a special Kosmos rule file I can quickly visualise any likely conflicts. Still have to decide what to do on conflicts: e.g., manually mark duplicated ways in NHD data to quickly retrieve on uploads.

Having now done most of the import, this is the process I envisage using in the future:

Problems

Problems experienced so far include:

Merging the data sets

After data has been imported it looks OK, but the data needs to be tidied up. First any bad data from failed or duplicate uploads needs removing, then waterways joined together, and finally these joined to riverbanks and lakes/ponds. I use JOSM with the validator plugin. For NHD 1401 (Upper Colorado) a level 11 tile typically has about 40000 nodes once the NHD data has been imported, so I just maintain a list of tile co-ordinates and work through them usually from W->E alternating N->S and S->N so that I always work with an adjacent tile. Its a good idea NOT to change data outside the download area to avoid the time-consuming need for conflict resolution. Here's what I do:

Throughout the process any conflicts with existing data may well become noticeable.

Other Issues

Personal tools
Namespaces
Variants
Actions
site
Toolbox