OSM Scientific Tools

From OpenStreetMap Wiki
Jump to navigation Jump to search

This page has the purpose of collecting and sharing information about:

  • Scientific computing on OSM vector data;
  • Geo-statistical analysis tools;
  • Data mining on OSM;
  • Tools and techniques for road network analysis.


Getting a local copy of OSM vector data

To run scientific analyses of OSM data, it is highly recommended to download a local copy of the vector data, without overloading the project's web infrastructure. OSM vector data is available in a variety of formats. Depending on the particular task, certain formats can be more appropriate than others.

  • OSM XML: general purpose format.
  • PBF is a binary format that supports fast access to OSM features.
  • For the analysis of geometries, PostGIS is a suitable DBMS. OSM vector data can be easily imported into PostGIS with Osm2postgresql.

Data Exploration

The following software can be used to explore OSM data:

  • OSMPythonTools is a Python library that enables easy and straight-forward access to Overpass endpoints, Nominatim, and the OSM API. The library makes use of Pandas internally in order to organize the resulting data and Matplotlib to visualize the results.

Processing Libraries and Tools

The following languages can be used to perform scientific computation and analysis of OSM vector data:

  • C++
  • Python: Numpy is suitable for statistical processing; see articles filed in Category:Python for OSM-specific Python tools.
  • Java
  • R

Libraries :

  • Osmium
    • Osmium Library ( A fast and flexible C++ library for working with OSM data. )
    • PyOsmium Python bindings to Osmium Library
    • Node-Osmium Node.js bindings to Osmium Library
    • Osmium Tool A multipurpose command line tool based on the Osmium Library
  • osmdata R package for fast extraction of OSM data and conversion to OGC standard Simple Features format
  • esy-osmfilter Python library for filtering and extraction of OSM data and conversion to GeoJSON.

Geo-statistical tools

R is a popular open-source platform to perform geo-statistical analyses. Several packages have been developed to process OSM data:

Processing of tags and semantics

The following Semantic Web projects can support the processing of OSM vector data from a semantic perspective: