Apertium

write a scraper for a bible.is content

Write a script, preferably in python (but ruby or similar is okay too), that accepts the name of a bible translation at the bible.is website (such as "UIGUMK") and dumps all content into a plain text file formatted using a sane format. It should be able to handle most (if not all) text translations available on the site, so make sure to test it on multiple languages.

You should make sure to set up command line arguments for the script in a sane way (for translation name, output file name, etc.). Feel free to copy code from other scrapers we have around.

For further information and guidance on this task, you are encouraged to come to our IRC channel.

Task tags

  • python
  • bible
  • html
  • scraper

Students who completed this task

vigneshv

Task type

  • code Code
close

2015