Wikimedia

In Pywikibot's download_dump.py file, handle cases when the dump file already exists

Pywikibot is a Python-based framework to write bots for MediaWiki (more information).

Thanks to work in Google Code-in, Pywikibot now has a script called download_dump.py. It downloads a Wikimedia database dump from http://dumps.wikimedia.org/ , and places the dump in a predicable directory for semi-automated use by other scripts and tests.

If the same file already exists in the folder :

  • If the filename doesn't contain latest, it shouldn't be downloaded again.
  • Endif, add the current date as a suffix to the name

You are expected to provide a patch in Wikimedia Gerrit. See https://www.mediawiki.org/wiki/Gerrit/Tutorial for how to set up Git and Gerrit.

Task tags

  • python
  • pywikibot

Students who completed this task

Rafid Aslam

Task type

  • code Code
close

2017