Optical Character Recognition Of Typewritten Amharic Text

Information Sciences Project Topics

Get the Complete Project Materials Now! ยป

Optical Character Recognition is an area of research where a system is made to accept arndocument image and convert it into ASCII code so that it will be easy for storage, retrieval,rnand filterer processing. OCR helps to convert a bulk of information available on paper tornelectronically processable format without human intervention -- saving time, money, andrnlabor Recently Optical Character Recognition for the Amharic Script has become an area ofrnresearch interest. Some developments have been made in recognizing characters withrnspecific type style, font, and font size. All the trials in this regard are on very high qualityrnlaser printouts on white papers. In reality, however, most Amharic typewritten documentsrnthat need to be converted into machine-readable format are typewritten and on non-whiternpaper III this study an attempt is made to explore the possibilities of developing an OCR system forrntypewritten Amharic text. To this end, features of the typewritten Amharic characters arernthoroughly studied. Some algorithms for noise removal and segmentation are reviewed. Thesernalgorithms are implemented to see their performance on typewritten Amharic text. Previousrnalgorithm implemented for recognition of Amharic characters is modified to incorporate thernspecific features of typewritten Amharic characters. The segmentation and the noise removalrnalgorithms are integrated with this algorithm. The result is tested on typewritten Amharicrndocuments, and test results are presented. Recommendations are also drawn to point outrnissues to be investigated filterer for the development of typewritten Amharic OCR system withrnbetter performance.

Get Full Work

Report copyright infringement or plagiarism

Be the First to Share On Social



1GB data
1GB data

RELATED TOPICS

1GB data
1GB data
Optical Character Recognition Of Typewritten Amharic Text

273