TfL Cycling Infrastructure Database

From OpenStreetMap Wiki
Jump to navigation Jump to search

Transport for London (TfL) have created a new database of cycling infrastructure, containing 240,000 assets, covering all of Greater London. This has been released as open data on 1st August 2019.

TfL CID - cycle track.png

This groundbreaking database contains every cycle infrastructure asset within Greater London, including assets on and off-carriageway. The assets surveyed are: cycle parking; signals; signage; traffic calming measures; restricted points (e.g. steps); advanced stop lines; crossings; cycle lanes/tracks; and restricted routes (e.g. pedestrian only routes).

A map of the data has been made available (see below).

About the CID database

TfL’s official press release stated:

“The world’s first Cycling Infrastructure Database will be the most comprehensive database of cycling infrastructure ever collected in London. Over the past 18 months, TfL has amassed data on every street in London, cataloguing almost 146,000 cycle parking spaces, 2,000 km of cycle lanes and more than 58,000 cycle signs and street markings. This information will be released as open data alongside a new digital map of cycle routes, will make journey planning and cycle parking much easier, as well as offering valuable information to TfL and the boroughs for planning future investment in cycling.”

Each asset is accompanied by two photos illustrating it, which will considerably enhance the ability of OSM mappers to merge data in remotely.

The collected data is a snapshot in time ranging between January 2017 and May 2018. The data was professionally surveyed by a team of surveyors.

TfL is keen to make this available to the OpenStreetMap community under a compatible open license, to ensure maximum use of the CID. TfL is also potentially willing to consider tool development to help facilitate sensitive merging in of this data.

Demonstrator map - feedback on data invited

TfL CID - cycle parking.png

A demonstrator map, has been created by CycleStreets.

We are specifically seeking comments on data quality and usefulness of this data from the OSM community. Initial analysis by CycleStreets is that the data is of excellent quality, and very suitable for conflation into OSM, to increase both comprehensiveness and metadata quality.

https://bikedata.cyclestreets.net/tflcid/

Usage notes: The controls on the right of the map allow the different feature types to be selected. The OSM layer (available at zoom level 19+) also provides a live feed from the OSM API, to enable quick comparisons. The two photos of each asset are shown, which will be particular useful for OSM to verify; all c. half-million photos have been cleared for GDPR purposes.

Current work

CycleStreets has been commissioned by TfL to create a report aimed at facilitating re-use of this data within OSM. As of August 1st, the report is nearing completion, a little later than expected following discussions during May/June with TfL relating to the license.

The deliverables of this report are to:

  1. Establish a mapping between the CID schema and geography types and the OpenStreetMap tagging system and geography types.
  2. Review the TfL open data licence and provide recommendations on licence compatibility with OpenStreetMap in regards to adding the CID data into OpenStreetMap.
  3. Identify options (e.g. tools, other arrangements) whereby TfL can utilise crowdsourcing to keep the CID up to date without introducing licence restrictions which are incompatible with their own open data licence.
  4. Undertake a comprehensive review of existing specialist OpenStreetMap data import (conflation) and data collection/data update tools. Provide recommendations as to which tool(s) are most suitable and whether any further tool development is required.
  5. If further tool development is required, outline the scope and engage with the relevant parties to provide TfL an estimate of the range of potential cost.
  6. Commence community engagement and report initial findings to TfL. In particular, is the OpenStreetMap community supportive of the data being added and are they likely to engage with the process of adding and maintaining the data? This will give TfL a better view how to proceed (e.g. is it worth proceeding to tool development and will the tools be used by the community or should TfL plan for the tools to be used by paid mappers?).

CID schema

The TfL CID Schema is available.

TfL have a version of the database which adds a further field which associates the asset feature with the relevant OSM Way nearby, using a GIS analysis.

Two images accompany each asset. These have been processed to meet data and privacy regulation.

Licensing

The OSMF Licensing Working Group have confirmed that they "believe that it is unproblematic to use this data in or as a source for OpenStreetMap", as noted in their minutes and as posted on talk-gb

Earlier detail:

TfL has made the data available under its open data license. This is the Transport Data Service, which is "based on version 2.0 of the Open Government Licence with specific amendments for Transport for London".

The OSMF Licensing Working Group has been contacted for their view on the compatibility of this license with the OSM Contributor Terms. We note the LWG's comments about Open Government Licence (OGL) based licences

Discussions during the summer have established the following, as noted on talk-gb:

  • The license is indeed that at: https://tfl.gov.uk/corporate/terms-and-conditions/transport-data-service , which is based on Open Government Licence v2 with some changes.
  • The license now contains mention of containing Geomni UKMap data, as of 17th July 2019.
  • The data was collected by the surveyors using UKMap as a background map, and then checking was later performed using aerial imagery from the same supplier.
  • Geomni have confirmed they do not regard themselves as having residual data rights in the released data, because TfL "haven't simply copied features from our data".
  • There is no use of Ordnance Survey data at all.
  • TfL are happy with commercial / non-commercial use of the released data.


Feedback

Feedback is very strongly encouraged, as soon as possible.

Please do discuss the data and related aspects noted above on the talk-gb mailing list.

Feedback and questions can also be e-mailed to: CycleStreets: info AT cyclestreets.net .

We are happy to provide any clarifications, which will be added to this page, as a central repository of information about the project.