Episode 45: BLP’s Revisited

Monday April 14th 2008, 5:23 pm
Filed under: Audio, Episodes, MP3

* Andrew Lih
* Tawker
* [Not] Privatemusings

Topics this week

* Wikipedia unblocked in China
* 100 x 5,000
o After Wikipedia reached 10,000,000 global articles on March 27 (see archived story), the Kapampangan Wikipedia became the 100th Wikipedia to reach 5,000 articles on April 2.
o Showing the growth of smaller-language Wikipedias, the 100th Wikipedia to reach 100 articles was the Nahuatl Wikipedia in June 2005 (see archived story); June 2006 saw the 100th to reach 1,000 articles, the Venetian Wikipedia (see archived story). In April 2006, about 73.4% of existing articles were located on the Top 10 Wikipedias; since then, only 59.4% of the new articles created over the last two years were on Top 10 Wikipedias.
* Provisional Volunteer Council - proposal sent to the Board
o effeietsanders
* Multilingual stats recap
o Hello, I would like to hear in WW more about the Wikipedia language versions, what happens in e.g. the Japanese or the Finnish Wikipedia. en:User:Ziko-en
o http://en.wikipedia.org/wiki/Wikipedia:Multilingual_statistics
o how does Volapuk, a conlang, have 115,000 articles, 112,000 added just this year?
* Wikimedia blog
* Wikimedia Foundation board meeting
* http://en.wikipedia.org/wiki/Wikipedia:BOO

 
icon for podpress  Wikipedia Weekly 45: BLP's Revisited [62:29m]: Play Now | Play in Popup | Download (1006)

 
icon for podpress  Wikipedia Weekly 45: BLP's Revisited: Play Now | Play in Popup | Download (399)
Share and Enjoy:
  • del.icio.us
  • digg
  • Fark
  • NewsVine
  • Reddit
  • Wists
  • YahooMyWeb

2 Comments »

  1. partionable?!

    that would have made more sense if you’d said, “every user would have to download the 133gigabyte full-edit-history dump, in order for a program to analyze any part of it.”

    Comment by samuel — April 21, 2008 @ 4:39 am

  2. There was a question here about the astounding growth of the Volapuk Wiki. I am an ‎administrator on the Esperanto Wiki (another conlang) so I’ve been watching the Volapuk ‎like they are a bad neighbor bringing down property values in the neighborhood.‎

    Yes, the panel was right, Volapuk was the work of a bot run amok. The bot operator ‎Smeira (the only active member of the Volapuk community) said that he wanted to ‎artificially inflate his numbers to advertise the Volapuk language. Please note there are an estimated 20 speakers of this language world-wide.‎

    The bot-operator essentially took databases of geographic data combined it with data out ‎of the various wikis, and fed them to his bot and out spat a bunch of articles. ‎Unfortunately, last I checked, the bot wasn’t well-controlled, and many of the pages were ‎filled with leftover vestiges of text and broken templates where the article was stolen ‎from. So, yes, there are 115000 stubs there, but they probably wouldn’t be considered ‎quality stubs by the other 19 Volapukists of the world.‎

    Since the guy started there have been a couple of failed attempts to get him to stop ‎puffing up the scores in the form of Requests for Deletion at Meta. In the second ‎proposal, Jimmy Wales himself has asked them to stop it and play nice, but stopped short ‎of declaring an edict to close it down. I think both of those proposals were defeated by ‎Smeira’s bringing in outside voters to support his position, and browbeating everyone ‎that disagreed with him.‎

    Essentially, the Volapuk Wikipedia is chock-full of geography stubs. 97% of their ‎activity supports the bot-made cookie-cutter geography stubs. ‎

    The only good that I’ve seen come out of this is the Esperanto Wikipedia giving up bot-‎creation of articles cold-turkey. We have called a moratorium on new articles by bot, and ‎we’re now creating articles the good old fashioned way. Yes, it’s slower, but there’s a big ‎difference in quality.‎

    Comment by Scott Starkey — May 1, 2008 @ 8:11 pm

RSS feed for comments on this post. TrackBack URI

Leave a comment

Line and paragraph breaks automatic, e-mail address never displayed, HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

(required)

(required)


 




Conestogastreet theme by Theron Parlin
Creative Commons License
This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License. It is also licensed under the GFDL. You may pick the license to use. This podcast is not a project nor is it endorsed by the Wikimedia Foundation. All ideas presented are those of the individual.