GSoC/GCI Archive
Google Code-in 2012 Apertium

FIXING PROPER NOUN TAGS FOR MACEDONIAN pt. 2

completed by: Андреј

mentors: Filip Petkovski

Notice: For this task you need to know bulgarian and macedonian. If you really thing you can do this task without knowing these two languages,  come at our IRC channel and we will see what can be done.

 

About 20.000 proper noun entries were automatically added to the Macedonian monolingual dictionary using resources like DBPedia and Wikipedia.

Many of the entries have the correct tags, but there is a substantial number of errors which need tobe fixed manually.

Download the xab and xac files from here:

https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mk-en/dev/gci/

and fix the incorrect tags.

 

Further instructions on IRC: irc.freenode.net #apertium