Automatic Stemming For Amharic Text An Experiment Using Successor Variety Approach

Information Sciences Project Topics

Get the Complete Project Materials Now! ยป

The extensive use of the World Wide Web and the increasing digital availability ofrninformation and documents accelerated the demand for technologies and tools forrnan online data retrieval and extraction application. The natural language research,rnwith the aim of quick and reliable online information searching and access, is onernmajor component of the current advanced information technology development. Inrnthis research , an indexing system was developed and programmed by using thernSuccessor Variety Stemming Algorithm to find stems for Amharic words. Thernresearch has set out to discover whether the Successor Variety StemmingrnAlgorithm technique with the peak and plateau, entropy and complete wordrnmethods can be used for the Amharic language or what the limitation would be. Inrnaddition, the peak and plateau method compared with the entropy and therncomplete words method. Stemming is typically used in the hope of improving thernaccuracy of the search reducing the size of the index. A corpus of 6270 words wasrnobtained form the Ethiopian News Agency (ENA) and Walta Information Centerrnand used to train and test the methods.rnThe experiment result showed that, the peak and plateau method had arnperformance of 71 .8% level of accuracy, but the performance of the entropy andrncomplete word methods are 63.95% and 57 .99% level of accuracy respectively.rnBased on the observation made from the experimentation result, the successorrnvariety algorithm with the peak and plateau method had a better performance thanrnsuccessor variety algorithm with the entropy method

Get Full Work

Report copyright infringement or plagiarism

Be the First to Share On Social



1GB data
1GB data

RELATED TOPICS

1GB data
1GB data
Automatic Stemming For Amharic Text An Experiment Using Successor Variety Approach

277