Designing English-kambaatissa Bilingual Electronic Dictionary Using Parallel Corpora

Linguistics Project Topics

Get the Complete Project Materials Now! ยป

The main aim of this study is to design English-Kambaati ssata bi lingual electronicrndictionary using Engli sh-Kambaati ssata parallel corpora. Literature review hasrnbeen made on Kambaatissata phonology and morphosyntax. Then, based on thernknowledge on Kambaatissa ta morphology, the study adopted stat istical machinerntranslation approach. To do so, fBMModell that is a word aliglU11ent model andrnwidely used in working with parallel bilingual corpora and which implementsrnexpect maxi mization algorithm has used.rnIn general , 1194 Kambaatissata and Engli sh sentences were used from parallelrnraw text. The raw texts were collected from the English-KambaatissatarnConstitution of the Federal Democratic Ethiop ia ( 1995), a tra ining material forrnprimary school mother tongue teachers and a Bible story book of children. Arndatabase of word alignment probabilities we re deve loped from the alignedrnsentences. These probabil ities were used to select the translation of EnglishKambaatissatarnword and Kambaati ssata-English. After having the translationrnequiva lent of English and Kambaatissata, parts of speech and gloss of the Englishrntern1S from th e English WordNet were extracted.rnThe accuracy of the des igned prototype was tested using 273 or 20% of thernretrieved di ctionary terms. Based on the manual evaluation, the result shows thatrn61.5% of the translation was correct.rnKey Words: Bilingua l electronic dictionary, para llel corpora, design,rnalignment and enhancement

Get Full Work

Report copyright infringement or plagiarism

Be the First to Share On Social



1GB data
1GB data

RELATED TOPICS

1GB data
1GB data
Designing English-kambaatissa Bilingual Electronic Dictionary Using Parallel Corpora

206