GSoC/GCI Archive
Google Code-in 2012 Apertium

bible scraper improvements

completed by: Daniel Huang

mentors: Francis Tyers, Jonathan

Add the following features to the bible scraper developed in a previous task:

  • scrape the Kyrgyz bible translation
  • scrape bible translations that just have a few verses, like Uzbek and Turkmen

Also, for this task, you should start by checking out the following svn directory — https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-tools/scrapers-misc/ — and putting the old script there for the course of development.  This will make it easier to follow your progress.  Make sure the script's name is unique to the website being scraped (e.g., biblescraper-website.py).  To get commit access to our svn, you will need a sourceforge account.

And make sure the script still functions as before!  You don't want to lose any functionality when you add features.