Santa Clara County, California/San Jose Sidewalk Import

From OpenStreetMap Wiki
Jump to: navigation, search
Available languages — Santa Clara County, California/San Jose Sidewalk Import
· Afrikaans · Alemannisch · aragonés · asturianu · azərbaycanca · Bahasa Indonesia · Bahasa Melayu · Bân-lâm-gú · Basa Jawa · Basa Sunda · Baso Minangkabau · bosanski · brezhoneg · català · čeština · corsu · dansk · Deutsch · eesti · English · español · Esperanto · estremeñu · euskara · français · Frysk · Gaeilge · Gàidhlig · galego · Hausa · hrvatski · Igbo · interlingua · Interlingue · isiXhosa · isiZulu · íslenska · italiano · Kiswahili · Kreyòl ayisyen · kréyòl gwadloupéyen · Kurdî · latviešu · Lëtzebuergesch · lietuvių · magyar · Malagasy · Malti · Nederlands · Nedersaksies · norsk bokmål · norsk nynorsk · occitan · Oromoo · oʻzbekcha/ўзбекча · Plattdüütsch · polski · português · română · shqip · slovenčina · slovenščina · Soomaaliga · suomi · svenska · Tagalog · Tiếng Việt · Türkçe · Vahcuengh · vèneto · Wolof · Yorùbá · Zazaki · српски / srpski · беларуская · български · қазақша · македонски · монгол · русский · тоҷикӣ · українська · Ελληνικά · Հայերեն · ქართული · नेपाली · मराठी · हिन्दी · भोजपुरी · অসমীয়া · বাংলা · ਪੰਜਾਬੀ · ગુજરાતી · ଓଡ଼ିଆ · தமிழ் · తెలుగు · ಕನ್ನಡ · മലയാളം · සිංහල · བོད་ཡིག · ไทย · မြန်မာဘာသာ · ລາວ · ភាសាខ្មែរ · ⵜⴰⵎⴰⵣⵉⵖⵜ ⵜⴰⵏⴰⵡⴰⵢⵜ‎ · አማርኛ · 한국어 · 日本語 · 中文(简体)‎ · 中文(繁體)‎ · 吴语 · 粵語 · ייִדיש · עברית · اردو · العربية · پښتو · سنڌي · فارسی · ދިވެހިބަސް

As a first step toward mapping the pedestrian network for Santa Clara County, this import would significantly improve OpenStreetMap's coverage of sidewalks in San Jose, California. We are using data derived from a shapefile released into the public domain by the City of San José.

This import will be performed piecemeal using the OSMUS Tasking Manager. Contributors to the tasking manager project will import small portions of the city at a time, taking care to adhere to best practices around sidewalk and crosswalk mapping. There will not be a large-scale, automated import of any kind. We are following the example set by successful imports of sidewalk data in Seattle and Southern Ohio.

The data we are importing contains only sidewalks and not crossings or curb cuts. The thrust of this project is to verify the accuracy of the sidewalk data and insert it on the map to form the backbone of the pedestrian network. Since we are only importing sidewalks, we will use commonly accepted, well supported tags, supplemented where necessary by the sidewalk schema proposed by the OpenSidewalks project.

Import Plan Outline

We created a Tasking Manager for this project We broke San Jose into about 640 sections using VTA TAZ census blocks. Some zones are bigger than others.

To an individual helping with this import it will consist of:

  • Visiting the project
  • Selecting a (very small, ~a few blocks) region to import
  • Downloading the relevant chunk of data for that region via the Tasking Manager and opening as a layer on JOSM.
  • Reviewing the data for accuracy based on ESRI or DigitalGlobe Premium imagery. Mapillary may be excessive at this stage. Bing is okay but is old.
  • Adding the local data and checking for existing sidewalks (there are very few of these)
  • Adding nodes for relevant intersecting features such as parking isles but not buildings or
  • Submitting their region to OpenStreetMap, pre-filled with a changeset tag referencing the San Jose Sidewalk Import Task
  • Getting their submission validated by another local user.


The primary goal of this import is to create a backbone of data for the pedestrian network within San Jose and Santa Clara County by importing open data on sidewalks from the San Jose City Public Works GIS Data page. This original source data is created from orthoimagery and is in polygons. This data is available under the public domain and has no licensing restrictions.

For the most part, sidewalks are not annotated in OpenStreetMap in the San Jose region, so this import would address a fundamental lack of information about the pedestrian right of way for this region. To explore the relevance of pedestrian network data to the question of accessibility, see the related project. However, sidewalk data is relevant to most individuals as it represents fundamental infrastructure supporting most other modes of transport in urbanized areas: public transit, automobile traffic, and cyclists.


  • September 2017: Getting Review Approval
  • September 2017: Start Sidewalk Importing
  • December 2017: Review and be done

Import Data

San Jose Pubic Works GIS Data:

Derived sidewalk data:

OSM Data Files

Generated on-the-fly based on neighborhood + block regions.

Import Type

The import will not be automated. We'll ask experienced mappers to create a separate import username such as username_sjsidewalks_import.

Data Preparation

Full script and details located here:

Tagging Plans

We'll just copy the reasoning of the Seattle Imports. Some work is being done with tagging streets but ultimately we think sidewalks as separate ways will be better.

We recognize we could be creating sidewalk islands since we aren't dealing with crossings or connecting the sidewalks to streets but most routing software doesn't use sidewalks and typically uses left/right tags. Regardless, adding sidewalks will eventually make things better.

There are attributes connected to the original sidewalks but they get lost during the data preparation process. None of the attributes available in the source data are typically used in OSM.

We will use source=San José Public Works Department


We are taking inspiration from the Seattle import: Seattle Washington Sidewalk Import


Our workflow is as follows:

  • Convert the city data to acceptable sidewalks in geojson format and split into tiny zones
  • Convert the tiny geojson zones of sidewalks into OSM XML using osmizer
  • Create an OSM Tasking Manager project using that set of polygons.

Our changeset size policy is smaller than census tract, and can be easily reviewed without making major map changes.

We take responsibility to revert any changes that cause breakages.


Our conflation strategy is we ask the importer to make a judgement call on which sidewalk is better and use that one accordingly. For the most part, we do not expect many conflation issues, due to the lack of existing sidewalk data.

Expected issues related to conflation:

  • Shared nodes with other pedestrian ways.
    • The sidewalk data will need to be combined with existing pedestrian ways intended to describe non-sidewalk pedestrian paths, such as plazas and trails.
    • In all such instances, a shared node will be added between the existing footpath and the new sidewalk data.
  • Existing pedestrian ways connected to streets:
    • Many dedicated pedestrian ways, such as paths through parks, are currently connected directly to streets.
    • These will not be modified in any way, outside of adding a node (described above) where they meet sidewalks. This helps prevent our import from breaking any existing routing software.
  • Existing crossings at intersections:
    • Where we encounter existing crossings at intersections we will make sure they are connected with the imported sidewalks.