ogr2osm
ogr2osm is a Python script which converts any OGR supported vector format into .osm or .pbf format, with an emphasis on converting polygon boundaries into relations.
Unlike most (if not all) other scripts, such as shp2osm and polyshp2osm, it can save a processing step by converting from any coordinate system supported by PROJ.4 to the system used by OpenStreetMap, (EPSG:4326).
Using a mechanism known as "translation", the script also allows the conversion of source tags and meta-data into properly formatted OpenStreetMap tags.
The current version is maintained by Roel Derickx, released under a MIT license, and located at github. Its development and documentation are focused on users of Ubuntu >18.04 or other Debian-based system.
Installation
Dependencies
Requires the ogr and osr module from the gdal library. (Debian/Ubuntu users will need to install gdal-bin, libgdal-dev and python3-gdal). If you want .pbf output you need to install the protocol buffers as well (Debian/Ubuntu users will need to install libprotobuf-dev, protobuf-compiler and python3-protobuf).
To install
git clone https://github.com/roelderickx/ogr2osm.git cd ogr2osm git submodule update --init
pip3 install .
To update ogr2osm:
cd ogr2osm git pull git submodule update
Usage
The basic usage is the same for all versions.
Run the script without any parameters or with --help to see the usage instructions. By default all attributes in the source file will be added to the resulting OSM file as is, however it can be supplied with a Python-based conversion file (known as a 'translation') to convert the source file's attributes into OSM tags. This allows the user to take full advantage of the Python language for processing attributes, such as fixing abbreviations or capitalization.
Example usage producing my-shapefile.osm, ready to be loaded (i.e. into JOSM).
ogr2osm my-shapefile.shp [-t my-translation-file.py]
See the translations/ directory for sample translation files. The translations for roelderickx's version are kept in the ogr2osm-translations repo and should be automatically downloaded.
How it works
The script has three separate phases: parse, merge, and output.
During the parse phase, the script loops through all the geometries in the OGR datasource and adds them to internal data structures. All lines and polygons get converted into a set of 2-node segments. Duplicated or re-used nodes and segments are detected and treated.
During the merge phase, 2-node segments will be merged into longer chains of nodes. Two adjacent segments or chains will be merged if both of them are members of the same geometries. This algorithm will effectively detect the node chains that are the boundary lines between polygons.
After the merge phase, all lines and polygons will be sets of node chains (ways). Geometries consisting of one way will be output as ways, and geometries consisting of more will be output as relations.
Known issues
Large Datasets
roelderickx's ogr2osm will work on very large files, given enough ram
The original version fails to convert a dataset when it is very big (e.g. a 500MB shapefile). In March 2010 Ivansanchez was reported to be working on a revamped version of ogr2osm that would be much slower (10x) but would hold all the data in a SQLite database instead of in memory.
Running under Windows
This thread http://lists.osgeo.org/pipermail/gdal-dev/2009-December/022992.html highlights a problem with running ogr2osm under Microsoft Windows. It will work with the OSGeo4W shell (download) provided that python-gdal bindings are installed. An easier way to install GDAL and python bindings for running ogr2osm on Windows is to use the Windows binaries from gisinternals.com. It is enough to install GDAL core components and python bindings for the right python version by running the corresponding MSI installer packages. Some people may feel it simpler to use a virtual machine using Ubuntu then to run ogr2osm under Windows.
Dealing with CSV files
Converting CSV files using the GDAL library is not an easy task. csv2osm is a script similar to ogr2osm to convert .csv files to the .osm format. This script allows to specify the columns containing the longitude and latitude information and to translate the columns into proper OpenStreetMap tags using a user-provided Python file, in the same way ogr2osm does.
Upgrading from pnorman's version
In the latest version of ogr2osm the translation file must contain a subclass of ogr2osm.TranslationBase in stead of separate functions to make the code more maintainable. If you are upgrading from pnorman's version and you use a translation file for your data, be sure to read about the modifications you will need to do.
License
Current Version - MIT License
Original/SVN: "Beerware" -
"THE BEER-WARE LICENSE":
<ivan@sanchezortega.es> wrote this file. As long as you retain this notice you can do whatever you want with this stuff. If we meet some day, and you think this stuff is worth it, you can buy me a beer in return.
History
Ogr2osm was created by Iván Sánchez Ortega, his version can be found on the OSM SVN server. It was rewritten by Andrew Guertin (Andrew Guertin's version for UVM) and subsequently maintained by Paul norman (Paul Norman's rewrite of Andrew Guertin's version for UVM). None of these versions is still under active development or maintenance.
