dc.contributor |
Graduate Program in Computer Engineering. |
|
dc.contributor.advisor |
Say, Ahmet Celal Cem. |
|
dc.contributor.author |
Pembe, Fatma Canan. |
|
dc.date.accessioned |
2023-03-16T10:02:10Z |
|
dc.date.available |
2023-03-16T10:02:10Z |
|
dc.date.issued |
2004. |
|
dc.identifier.other |
CMPE 2004 P46 |
|
dc.identifier.uri |
http://digitalarchive.boun.edu.tr/handle/123456789/12300 |
|
dc.description.abstract |
Information retrieval (IR) has become an important application in today's computer world because of the great increase in the amount of web-based documents and the widespread use of the Internet. However, the classical "bag of words" approach no longer meets user expectations adequately. In this context, the use of natural language processing (NLP) techniques comes into mind. In this thesis, we investigate the question of whether NLP techniques can improve the effectiveness of information retrieval in Turkish. We implemented a linguistically motivated information retrieval system, called TURNA (TUrkish information Retrieval engine based on Natural language Analysis). The system uses knowledge of three different levels of natural language processing in document and query processing: morphological, syntactical and lexico-semantical levels. Different combinations of these NLP techniques are tested on a set of Turkish documents and queries. The results are evaluated in terms of precision and recall. It is shown that natural language processing techniques, especially stemming and the use of syntactical head-modifier pairs, can improve information retrieval effectiveness in Turkish. |
|
dc.format.extent |
30cm. |
|
dc.publisher |
Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2004. |
|
dc.relation |
Includes appendices. |
|
dc.relation |
Includes appendices. |
|
dc.subject.lcsh |
Information storage and retrieval systems. |
|
dc.subject.lcsh |
Natural language processing (Computer science) |
|
dc.subject.lcsh |
Morphology. |
|
dc.subject.lcsh |
Turkish language -- Syntax. |
|
dc.title |
A linguistically motivated information retrieval system for Turkish |
|
dc.format.pages |
xiii, 66 leaves; |
|