The Application Of Information Retrieval Techniques To Amharic Documents On The Web

Information Sciences Project Topics

Get the Complete Project Materials Now! »

The World Wide Web is an escalating mass of interconnected data that stretches from computerrnto computer across the world. Information retrieval systems on the Web provide users withrnrelevant information without human intervention, saving time, labor and money.rnThe Web contains documents of diverse content in different languages. Making those documentsrnaccessible to users has become a difficult task with the fast growth of the Web. Hencerndeveloping information retrieval systems to cope with inherent features of Web data has been arnresearch area of tile time in information science.rnIn this study an attempt is made to explore the possibilities of applying some information retrievalrntechniques for Amharic documents on the Web. To back tile research, literature review on relatedrnworks has been made. Different information retrieval techniques and algorithms used on otherrnlanguages have been reviewed to determine the possibilities of applying them to Amharicrndocuments on the Web.rnA database that stores Amharic Web page data, suffix list and index files has been designed.rnWeb page submission form was developed to allow the submission of Web page data into therndatabase. Designing an Amharic •query input interface was also part of the research.rnAutomatic indexing and searching techniques have been applied on a collection of 313 Webrnpages of Amharic documents taken from Walta Information Center news publications.rnWord and stem inverted index options were explored. An Amharic search interface was thenrncreated to handle Amharic data on the Web using ColdFusion Studio and ColdFusion Server 4.0rnon Windows NT 4.0 Operating System and Internet Information Server (liS).rnThe searching algorithm that was implemented is Expended Boolean model, which is a Booleanrnmodel with a vector functionality that allowed to rank retrieved documents.rnTo measure tile performance of the prototype system, retrieval experiments have beenrnconducted for twenty-two queries and an average recall-precision graph is drawn. Using termsrnwith suffixes and prefixes removed resulted in a better performance than using wordsrnFinally, conclusions are drawn based on the test results obtained and recommendations arernmade as 10 what further researches could be done for the development of Amharic informationrnretrieval systems on the Web.

Get Full Work

Report copyright infringement or plagiarism

Be the First to Share On Social



1GB data
1GB data

RELATED TOPICS

1GB data
1GB data
The  Application Of Information Retrieval Techniques To Amharic Documents On The Web

295