The Apertium project
businessMailing List: apertium-stuff@lists.sourceforge.net
* The Apertium project develops a free/open-source platform for machine translation and language technology. We try and focus our efforts on lesser-resourced and marginalised languages, but also work with larger languages.
* The platform, including data for a large number of language pairs, a translation engine and auxiliary tools is being developed around the world, largely in universities and companies (e.g. Prompsit Language Engineering), but also independent free-software developers play a huge role.
* There are currently 25 published language pairs within the project (including a number of "firsts" — for example Spanish—Occitan, Breton—French, and Basque—Spanish among others), and several more in development.
Completed Tasks
- Add 150 or so verbs to the Dutch morphological analyser
- Add 350 adjectives to the Dutch analyser
- Add compound word support to Dutch--Afrikaans
- Add genitives to paradigms in Dutch morphological analyser
- Add missing nouns to Dutch analyser
- Add separable verbs to the Dutch morphological analyser
- Add the top-100 missing words to the Afrikaans--Dutch translator
- Add words to dictionaries to give complete coverage over pending tests (Dutch-Afrikaans)
- Add ~300 verbs and some nouns to the dutch morphological analyser
- Bulgarian and Russian noun dictionary
- Catalogue resources: Aromanian
- Catalogue resources: Friulian
- Catalogue resources: Kazakh
- Catalogue resources: Kyrgyz
- Categorise nouns and paradigms by genitive suffix needed
- Categorise translation errors in Afrikaans to Dutch MT
- Comparative evaluation Google translate / Apertium
- Complete new language pair HOWTO: Bulgarian--Russian
- Complete Wikipedia article: Hindi
- Complete Wikipedia article: Romanian
- Contrastive analysis: Bulgarian and Greek
- Contrastive analysis: Bulgarian and Russian
- Contrastive analysis: Czech and Slovak
- Contrastive analysis: English and French
- Contrastive analysis: Hindi and Urdu
- Contrastive analysis: Italian and Sardinian
- Contrastive analysis: Latvian and Russian
- Contrastive analysis: Norwegian (Bokmål) and English
- Contrastive analysis: Polish and Lower Sorbian (dolnołużycki)
- Contrastive analysis: Polish and Slovak
- Contrastive analysis: Romanian and French
- Convert Czech--Polish MT system to Czech--Slovak
- Convert Java code for decomposing compound words into C++
- Create a PHP class to translate with Apertium
- Create a wordlist for Aromanian-Romanian
- Create disambiguation rules for Dutch and Afrikaans
- Describe the Dutch--Afrikaans MT system
- Document error categorisation in Dutch--Afrikaans MT
- Document frequent errors in the Afrikaans to Dutch translator
- English-French: find the top 400 missing words from a set of Wikipedia articles
- Evaluate efficacy of decompounding algorithm in Dutch--Afrikaans MT
- Expand Bulgarian and Russian dictionary
- Expand Bulgarian and Russian dictionary
- Expand Bulgarian and Russian dictionary: Continuation
- Expand Bulgarian and Russian dictionary: Continuation
- Expand coverage of the Dutch to Afrikaans system
- Expand Italian and Friulian dictionary
- Expand Wikipedia article: Bulgarian
- Expansion of Macedonian Wikipedia
- Extract Dutch inflection from Wiktionary
- Finish HOWTO Translation: Polish
- Finish translating HOWTO: Bulgarian
- Finish translating HOWTO: Russian
- Fix errors in Dutch->Afrikaans MT system
- Fix generation errors in Dutch->Afrikaans MT system
- Further evaluation: Dutch--Afrikaans MT
- Guide for Windows users
- Kazakh-English dictionary
- Morphological analyser: Dutch
- Morphological analyser: Dutch
- NSIS installer script for hfst in Windows
- NSIS installer script for language pairs in Windows
- NSIS installer script for Windows
- Proofread Bulgarian and Greek dictionary
- Proofread Friulian and Italian dictionary
- Proofread HOWTO translation: Bengali
- Report on release freshness
- Semi-automatic Romanian and French dictionary
- Test the genitive handling in Afrikaans to Dutch MT
- Train part-of-speech taggers for Dutch and Afrikaans
- Translate 500 verbs from Afrikaans to Dutch
- Translate 'Cross-model' Wiki page into Romanian
- Translate 300 verbs from Bulgarian to Russian
- Translate Afrikaans nouns and adjectives into Dutch
- Translate Afrikaans verbs, adverbs and closed categories into Dutch
- Translate Apertium Wikipedia article: Dutch
- Translate Apertium Wikipedia article: Italian
- Translate Apertium Wikipedia article: Macedonian
- Translate Apertium Wikipedia article: Polish
- Translate Apertium Wikipedia article: Punjabi
- Translate Apertium Wikipedia article: Slovak
- Translate Apertium Wikipedia article: Urdu
- Translate from Czech to Polish
- Translate HOWTO: Italian
- Translate HOWTO: Slovakian
- Translate prepositions, conjunctions and adverbs from Bulgarian to Russian
- Translate the Apertium New Language Pair HOWTO to Ukrainian
- Translate the HOWTO: Norwegian Bokmål
- Translate ~260 words from Czech to Polish
- Write an "Indirect Contributors Guide"
- Write an Apertium aware wrapper for Hunspell's 'analyze'