Geonames [1] is a database which holds around 9 M entries of
geographical related items from all over the world.
Lsjbot is now generating articles from a subset of it, after several
months of extensive research on its quality, Wikidata relations and
notability issues. While the quality in some regions is substandard (and
these will not be generated) it was seen as very good in most areas. In
the discussion I was intrigued to learn that identical Arabic names
should be transcribed differently depending on its geographic location.
And I was fascinated of the question of notability of wells in the
Bahrain desert (which in the end was excluded, mostly because we knew
too little of that reality)
In this run Lsjbot has extended its functionality even further then when
it generated articles for species. It looks for relevant geographical
items close to the actual one: a lake close by, a mountain and where is
the nearest major town etc.
Macedonia can be taken as one example. Lsjbot generated over 10000
articles (and 5000 disambiguous pages) making it a magnitude more than
what exist in enwp. Also for a well defined type like villages, almost
50% as many has been generated than existing in enwp. One example [2]
where you can see what has been generated (and note the reuse of a
relevant figure existing in frwp). Please compare the corresponding
articles on other languages in this case, many having less information
than the bot generated one.
The generation is still in early stage [.3) but has already got the
article count for svwp to pass 2 M today. But it will take many months
more before completed and perhaps more M marks will be passed before it
is through. If you want to give feedback you are welcome to enter it at [4]
Anders
(with all credits for the Lsjbot to be given to Sverker, its owner, I am
just one of the many supporters of him and his bot on svwp)
[1]
http://www.geonames.org/about.html
[2]
https://sv.wikipedia.org/wiki/Polaki_%28ort_i_Makedonien%29
[3]
https://sv.wikipedia.org/wiki/Kategori:Robotskapade_geografiartiklar
[4]
https://sv.wikipedia.org/wiki/Anv%C3%A4ndardiskussion:Lsjbot/Projekt_alla_platser
_______________________________________________
Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, <mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
geographical related items from all over the world.
Lsjbot is now generating articles from a subset of it, after several
months of extensive research on its quality, Wikidata relations and
notability issues. While the quality in some regions is substandard (and
these will not be generated) it was seen as very good in most areas. In
the discussion I was intrigued to learn that identical Arabic names
should be transcribed differently depending on its geographic location.
And I was fascinated of the question of notability of wells in the
Bahrain desert (which in the end was excluded, mostly because we knew
too little of that reality)
In this run Lsjbot has extended its functionality even further then when
it generated articles for species. It looks for relevant geographical
items close to the actual one: a lake close by, a mountain and where is
the nearest major town etc.
Macedonia can be taken as one example. Lsjbot generated over 10000
articles (and 5000 disambiguous pages) making it a magnitude more than
what exist in enwp. Also for a well defined type like villages, almost
50% as many has been generated than existing in enwp. One example [2]
where you can see what has been generated (and note the reuse of a
relevant figure existing in frwp). Please compare the corresponding
articles on other languages in this case, many having less information
than the bot generated one.
The generation is still in early stage [.3) but has already got the
article count for svwp to pass 2 M today. But it will take many months
more before completed and perhaps more M marks will be passed before it
is through. If you want to give feedback you are welcome to enter it at [4]
Anders
(with all credits for the Lsjbot to be given to Sverker, its owner, I am
just one of the many supporters of him and his bot on svwp)
[1]
http://www.geonames.org/about.html
[2]
https://sv.wikipedia.org/wiki/Polaki_%28ort_i_Makedonien%29
[3]
https://sv.wikipedia.org/wiki/Kategori:Robotskapade_geografiartiklar
[4]
https://sv.wikipedia.org/wiki/Anv%C3%A4ndardiskussion:Lsjbot/Projekt_alla_platser
_______________________________________________
Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l@lists.wikimedia.org
Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, <mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>