Skip to content

An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.

License

Notifications You must be signed in to change notification settings

wikiwho/WikiWho

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WikiWho

An algorithm to identify the revision origin of - and all changes ever applied to - the tokens of a revisioned Wiki document. This enables, e.g., detecting authorship and editor interactions.

Also check out the WikiWho API for current data from Wikipedia.

Requirements and Installation

WikiWho runs both on python 2 and 3.

requests package is required to get revision meta data and text from Wikipedia api.

mwxml package is required to get revision meta data and text from xml dumps. This package runs on only python 3.

Install WikiWho package using pip:

pip install WikiWho

Running WikiWho

You can check example scripts under WikiWho/WikiWho/examples to see how to run WikiWho.

Contact

  • Fabian Floeck: fabian.floeck[.]gesis.org
  • Maribel Acosta: maribel.acosta[.]kit.edu
  • Kenan Erdogan: kenan.erdogan[.]gesis.org

License

This work is licensed under MIT.

Developed at Karlsruhe Institute of Technology and GESIS - Leibniz Institute for the Social Sciences

About

An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%