KDnuggets Home » News » 2011 » May » Software » Wikipedia User Contribution Dataset  ( < Prev | 11:n13 | Next > )

Wikipedia User Contribution Dataset


 
  
prepared for an ongoing study on user reputation and content quality in Wikipedia at U. California, Irvine.


Sara Javanmardi This dataset was prepared for an ongoing study on user reputation and content quality in Wikipedia at University of California, Irvine.

This research is done mainly by Sara Javanmardi under the supervision of Prof. Lopes and Prof. Baldi.


We processed the Wikipedia dump,
enwiki-20100130-pages-meta-history.xml.7z, in order to extract inserts and deletes done by each user. The dump contains all English Wikipedia articles up to January 2010. You can access this dataset both online and offline:

Online Access

We have prepared an XML interface that allows you to extract events happened in a page. To extract events of each article in English Wikipedia you need to pass three parameters to the API and will receive the results in XML format.

See nile.ics.uci.edu/events-dataset-api/


KDnuggets Home » News » 2011 » May » Software » Wikipedia User Contribution Dataset  ( < Prev | 11:n13 | Next > )