Mechanical Edits/Mateusz Konieczny - bot account/elimination of old-style Wikipedia links in Poland

From OpenStreetMap Wiki
Jump to navigation Jump to search

Page content created as advised on Automated_Edits_code_of_conduct#Document_and_discuss_your_plans.

Old style wikipedia link is one where language is stored in key, not in value.


For example "wikipedia:en=Kansas" is an old style link, while "wikipedia=en:Kansas" is a form that is currently standard.


Many old-style Wikipedia links remain and updating them to new style manually is boring, tedious and some mistakes may appear during this.


Some OSM elements have old-style Wikipedia link without new tag what means that this data is harder to process for editors and data consumers.


Also, remaining old-style Wikipedia tags confuse mappers, especially less experienced.

Who

I, Mateusz Konieczny using my bot account

contact

message via OSM I will respond also to PMs to the bot account, though messaging my main account is preferable as I will get notifications in OSM editors.

English and Polish languages are preferable, for other I need to use an automatic translator.

What

Removing old style wikipedia tags like wikipedia:en=article_name or wikipedia:pl=article_name where linked article is not conflicting wikipedia=* and wikidata=* tags.

Editing is restricted to Poland.

Why

wikipedia=* are used by editing programs, editors and data consumers and considered as preferable to old-style wikipedia keys.

Old-style Wikipedia tags are considered as undesirable if they duplicate already existing tags (and nobody wants https://www.openstreetmap.org/relation/2768922 with secondary wikipedia tags for every wikipedia which created article about this place).

Numbers

Depends on how many new old-style tags appear - depends on editing activity in OSM. Such occurrences are extremely rare, but at this moment there are around 6000 old tags to process - see http://taginfo.openstreetmap.pl/search?q=wikipedia%3A .

How

  • Editing is limited to objects with old-style Wikipedia tags is not conflicting with existing wikipedia=* or wikidata=* tag or other old-style wikipedia tags.
  • Links detected as invalid (leading to disambigs, articles about humans, animals, plants, events etc) are also skipped
  • Each changeset contains a single region to avoid edits spanning across large areas (it is impossible in cases where edited object itself spans very large area)
  • After every changeset bot sleeps for one minute

A theoretical example:
state before a mechanical edit:

state after a mechanical edit:


Changeset would be described and tagged with tags that mark it as automatic, provide link to discussion approving edit etc

Discussion

Approved by Polish OSM community at https://forum.openstreetmap.org/viewtopic.php?id=62261

Repetition

This is reoccurring edit and may be made as soon as new matching elements appear. At this moment triggering new edit requires human intervention so exact schedule is not predictable and bot may stop running at any moment.

This can change in a future. If bot is abandoned and does not run, feel free to ping me. If I am unable to run it any more feel free to use my code. Note that it may require going through bot approval process again and that code is on specific license.

https://codeberg.org/matkoniecz/OpenStreetMap_cleanup_scripts/src/branch/master/recurrent_bot_edits may have more up to date code version that what is listed on this page

Opt-out

Please write at https://forum.openstreetmap.org/viewtopic.php?id=62261 .