Skip to content

Framework of tools and libraries for building and running bots on Wikipedia

License

Notifications You must be signed in to change notification settings

greencardamom/BotWikiAwk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BotWikiAwk is a framework of tools and libraries for creating and running bots on Wikipedia.

  • Bot management tools compatible with bots written in any language
  • .. and/or libraries for bots written in awk
  • Non-SQL. Data files in plain-text
  • Manage batches of articles of any size, 50 for WP:BRFA or 50k to unlimited for production runs
  • Runs using GNU parallel making full use of multi-core CPUs
  • ..or on the Toolforge grid across 40+ distributed computers
  • Dry-run mode, diffs can be checked out before uploading
  • Inline colorized diffs on the command-line
  • Re-run individual pages via a cached copy of the page (download wikisource once, run bot many)
  • Installs in a single directory, easily removed
  • Includes complete example bots and skeleton bots
  • Includes a general awk library developed over years of writing bots
  • Includes a standalone command-line program to interface with the MediaWiki API
  • In development and private use since 2016. Public June 2018

Example run a 50k-article bot with three commands:

makebot ~/BotWikiBot/bots/accdate
project -c -p accdate20181102.00001-50000
runbot accdate20181102.00001-50000 auth
  • Install BotWikiAwk library

      cd ~
      git clone 'https://github.com/greencardamom/BotWikiAwk'
      export AWKPATH=.:/home/user/BotWikiAwk/lib:/usr/share/awk
      export PATH=$PATH:/home/user/BotWikiAwk/bin
      cd ~/BotWikiAwk
      ./setup.sh
      read SETUP for further instructions eg. setting up email
    

Documentation

About

Framework of tools and libraries for building and running bots on Wikipedia

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published