Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Türk Kütüphaneciler Derneği (Turkish Librarians' Association)
2016-12-01
|
Series: | Türk Kütüphaneciliği |
Subjects: | |
Online Access: | http://tk.org.tr/index.php/TK/article/view/2731 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841563736398102528 |
---|---|
author | Çağdaş Çapkın |
author_facet | Çağdaş Çapkın |
author_sort | Çağdaş Çapkın |
collection | DOAJ |
description | Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP) performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR) and information retrieval performance improved. |
format | Article |
id | doaj-art-f33e36c4e4a345f18df32ae39e8889fc |
institution | Kabale University |
issn | 1300-0039 2147-9682 |
language | English |
publishDate | 2016-12-01 |
publisher | Türk Kütüphaneciler Derneği (Turkish Librarians' Association) |
record_format | Article |
series | Türk Kütüphaneciliği |
spelling | doaj-art-f33e36c4e4a345f18df32ae39e8889fc2025-01-02T23:39:26ZengTürk Kütüphaneciler Derneği (Turkish Librarians' Association)Türk Kütüphaneciliği1300-00392147-96822016-12-013046787012647Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish CorpusÇağdaş ÇapkınInformation institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP) performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR) and information retrieval performance improved.http://tk.org.tr/index.php/TK/article/view/2731Bilgi erişimdizinlemeotomatik dizinlemeüstveriperformans değerlendirmeTürk Kütüphaneciliği |
spellingShingle | Çağdaş Çapkın Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus Türk Kütüphaneciliği Bilgi erişim dizinleme otomatik dizinleme üstveri performans değerlendirme Türk Kütüphaneciliği |
title | Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus |
title_full | Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus |
title_fullStr | Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus |
title_full_unstemmed | Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus |
title_short | Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus |
title_sort | impact of metadata on full text information retrieval performance an experimental research on a small scale turkish corpus |
topic | Bilgi erişim dizinleme otomatik dizinleme üstveri performans değerlendirme Türk Kütüphaneciliği |
url | http://tk.org.tr/index.php/TK/article/view/2731 |
work_keys_str_mv | AT cagdascapkın impactofmetadataonfulltextinformationretrievalperformanceanexperimentalresearchonasmallscaleturkishcorpus |