Mechanical Edits/Mateusz Konieczny - bot account/elimination of old-style Wikipedia links in USA

From OpenStreetMap Wiki
Jump to navigation Jump to search

Page content created as advised on Automated_Edits_code_of_conduct#Document_and_discuss_your_plans.


Old style wikipedia link is one where language is stored in key, not in value.


For example "wikipedia:en=Kansas" is an old style link, while "wikipedia=en:Kansas" is a form that is currently standard.


Many old-style Wikipedia links remain and updating them to new style manually is boring, tedious and some mistakes may appear during this.


Some OSM elements have old-style Wikipedia link without new tag what means that this data is harder to process for editors and data consumers.


Also, remaining old-style Wikipedia tags confuse mappers, especially less experienced.

Who

I, Mateusz Konieczny using my bot account

contact

message via OSM I will respond also to PMs to the bot account, though messaging my main account is preferable as I will get notifications in OSM editors.

English and Polish languages are preferable, for other I need to use an automatic translator.

What

Removing old style wikipedia tags like wikipedia:en=article_name or wikipedia:pl=article_name where linked article is not conflicting wikipedia=* and wikidata=* tags.

Editing is restricted to USA.

Why

wikipedia=* are used by editing programs, editors and data consumers and considered as preferable to old-style wikipedia keys.

Old-style Wikipedia tags are considered as undesirable if they duplicate already existing tags (and nobody wants https://www.openstreetmap.org/relation/161644 with secondary wikipedia tags for every wikipedia which created article about this place).

Numbers

Depends on how many new old-style tags exist and will continue to appear - depends on editing activity in OSM. There is no working taginfo instance for USA (see https://wiki.openstreetmap.org/wiki/Taginfo/Sites ) but worldwide there are about 50k old-style Wikipedia tags ( https://taginfo.openstreetmap.org/search?q=wikipedia%3A ) and USA has only part of them - for start the most popular wikipedia:ru=* with 30k instances appear to present nearly completely outside USA.

How

  • Editing is limited to objects with old-style Wikipedia tags is not conflicting with existing wikipedia=* or wikidata=* tag or other old-style wikipedia tags.
  • Links detected as invalid (leading to disambigs, articles about humans, animals, plants, events etc) are also skipped
  • Each changeset contains a single element or group of close elements to avoid edits spanning across large areas (it is impossible in cases where edited object itself spans very large area)
  • After every changeset bot sleeps for one minute

An example based on https://www.openstreetmap.org/relation/8033743 :
state before a mechanical edit:

state after a mechanical edit (note wikipedia tag):

Changeset would be described and tagged with tags that mark it as automatic, provide link to discussion approving edit etc

Discussion

posted on talk-us at https://lists.openstreetmap.org/pipermail/talk-us/2019-February/019231.html

posted into slack us https://osmus.slack.com/archives/C029HV951/p1551187362056500 (as advised at https://github.com/osmlab/osm-community-index/pull/219#issuecomment-467427863 )

Repetition

This is proposed as reoccurring edit and may be made as soon as new old-style wikipedia links appear. At this moment triggering new edit requires human intervention and review so exact schedule is not predictable and bot may stop running at any moment.

Similar edits

I am already running the same type of edit in Poland. See Mechanical Edits/Mateusz Konieczny - bot account/elimination of old-style Wikipedia links in Poland for a documentation page and https://www.openstreetmap.org/changeset/67574797 for an example edit.

Opt-out

Please write in discussion thread at talk-us mailing list. To verify your account please also send me a PM via OSM messaging system.