GSoC/GCI Archive
Google Code-in 2012 Apertium

Extract Armenian adverb translations from Wiktionary

completed by: Denis Nikolov

mentors: Francis Tyers, Jonathan

Wiktionary has lots of translations for Armenian adverbs, for example consider the page:



ագահաբար (agahabar)

  1. greedily, avidly
  2. eagerly




The idea of this task is to extract these translations into lttoolbox XML format as follows:

<e c=""><p><l>ագահաբար<s n="adv"/></l><r>greedily<s n="adv"/></r></p></e>
<e c=""><p><l>ագահաբար<s n="adv"/></l><r>avidly<s n="adv"/></r></p></e>
<e c=""><p><l>ագահաբար<s n="adv"/></l><r>eagerly<s n="adv"/></r></p></e>


You will need to look out for:


* Make sure that the translations are from the "Adverb" section -- not the "Adjective" section. Many Armenian adverbs can also be adjectives.



For further information about this task, join us on IRC: #apertium