Zenbu

From OpenStreetMap Wiki
Jump to: navigation, search

Zenbu is a business listings website, containing data for companies operating in New Zealand; the data on the site is user-generated and released under a license compatible with OSM (Creative Commons by Attribution) [1]. They currently (2008-01) have over 40,000 POIs.

This page is dedicated to facilitating the sharing of data between Zenbu and OSM (a two-way process).

Contents

Data Attribution

a method for attributing the data back to Zenbu, needs to be developed:
The license the zenbu data is released under states that "You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work)." Some method must therefore be arrived at, for attributing the data, which the operators of the Zenbu website are happy with. Possible methods include:

zenbu tags --> OSM tags

The POIs in zenbu have tags to describe what they represent (police station, takeaway, library, etc.) These need to have corresponding categories devised in OSM, and a table produced which maps the zenbu tags to the OSM tags. When the periodic import happens, this table will drive the change of tag names

ignored
updated in zenbu before being exported to OSM
modified at some intermediary stage, either manually or possibly using a script to guess at what they do from their name (e.g. 'Gilbert's cyclery' should probably be a cycle shop)
imported into OSM, tagged as being incomplete and manually updated gradually (and possibly exported to zenbu)

to do

Ascertain categories used in Zenbu
Develop corresponding tags in OSM
Create a table to tie the two sets of tags together (assuming they are not identically named)

Zenbu tags - a list of all tags used in Zenbu, with their equivalent OSM tags and keys

additional tags

we may need to add additional tags, to be able to keep track of the data/clear up mistakes later
these may include:

geocoding

the zenbu data incorporates latitude and longitude values for each POI. these coordinates were arrived at by one of three methods:

The third of these options gives data which is derived from a non-free source i.e. it is incompatible with OSM's license. This represents a significant proportion of the data, which thus has to be re-encoded, either manually or using the LINZ database.

software for importing

the zenbu data is released as kml, gpx snd csv
the data could therefore conceivably be imported with JOSM - what are the practical limitations on the amount of data that JOSM can handle in one hit?
if JOSM is not suitable, a custom script may need to be developed, along similar lines to the ones used for AND and TIGER

the import process

learning from TIGER, and the aborted 2005 import, it would be sensible to break the data into sections and import gradually, looking for errors that may crop up as we go

Personal tools
Namespaces
Variants
Actions
site
Toolbox