GSoC/GCI Archive
Google Code-in 2012 Apertium

Create a corpus of Armenian from RFE/RL

completed by: Sushain Cherivirala

mentors: Francis Tyers, Jonathan

Write a plugin (=series of classes) for the apertium RFE/RL/etc. scraper (found at ) to parse the Armenian-language RFE/RL site (  You will also need to write a script to test it with, similar to the scrp-*.py scripts found with the scraper.  The test script should be demonstrated to work by scraping a month's worth of articles.