GSoC/GCI Archive
Google Code-in 2012 Apertium

Fixing proper noun tags for macedonian pt. 5

completed by: Angel

mentors: Filip Petkovski

Notice: For this task you need to know macedonian. If you really thing you can do this task without knowing this language,  come to our IRC channel and we will see what can be done.


About 20.000 proper noun entries were automatically added to the Macedonian monolingual dictionary using resources like DBPedia and Wikipedia.

Many of the entries have the correct tags, but there is a substantial number of errors which need tobe fixed manually.

Download the xah and xai files from here:

and fix the incorrect tags.


Further instructions on IRC: #apertium