Archives and Documentation Center
Digital Archives

A linguistically motivated information retrieval system for Turkish

Show simple item record

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Say, Ahmet Celal Cem.
dc.contributor.author Pembe, Fatma Canan.
dc.date.accessioned 2023-03-16T10:02:10Z
dc.date.available 2023-03-16T10:02:10Z
dc.date.issued 2004.
dc.identifier.other CMPE 2004 P46
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12300
dc.description.abstract Information retrieval (IR) has become an important application in today's computer world because of the great increase in the amount of web-based documents and the widespread use of the Internet. However, the classical "bag of words" approach no longer meets user expectations adequately. In this context, the use of natural language processing (NLP) techniques comes into mind. In this thesis, we investigate the question of whether NLP techniques can improve the effectiveness of information retrieval in Turkish. We implemented a linguistically motivated information retrieval system, called TURNA (TUrkish information Retrieval engine based on Natural language Analysis). The system uses knowledge of three different levels of natural language processing in document and query processing: morphological, syntactical and lexico-semantical levels. Different combinations of these NLP techniques are tested on a set of Turkish documents and queries. The results are evaluated in terms of precision and recall. It is shown that natural language processing techniques, especially stemming and the use of syntactical head-modifier pairs, can improve information retrieval effectiveness in Turkish.
dc.format.extent 30cm.
dc.publisher Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2004.
dc.relation Includes appendices.
dc.relation Includes appendices.
dc.subject.lcsh Information storage and retrieval systems.
dc.subject.lcsh Natural language processing (Computer science)
dc.subject.lcsh Morphology.
dc.subject.lcsh Turkish language -- Syntax.
dc.title A linguistically motivated information retrieval system for Turkish
dc.format.pages xiii, 66 leaves;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account