Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus

Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata...

Full description

Saved in:
Bibliographic Details
Main Author: Çağdaş Çapkın
Format: Article
Language:English
Published: Türk Kütüphaneciler Derneği (Turkish Librarians' Association) 2016-12-01
Series:Türk Kütüphaneciliği
Subjects:
Online Access:http://tk.org.tr/index.php/TK/article/view/2731
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841563736398102528
author Çağdaş Çapkın
author_facet Çağdaş Çapkın
author_sort Çağdaş Çapkın
collection DOAJ
description Information institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP) performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR) and information retrieval performance improved.
format Article
id doaj-art-f33e36c4e4a345f18df32ae39e8889fc
institution Kabale University
issn 1300-0039
2147-9682
language English
publishDate 2016-12-01
publisher Türk Kütüphaneciler Derneği (Turkish Librarians' Association)
record_format Article
series Türk Kütüphaneciliği
spelling doaj-art-f33e36c4e4a345f18df32ae39e8889fc2025-01-02T23:39:26ZengTürk Kütüphaneciler Derneği (Turkish Librarians' Association)Türk Kütüphaneciliği1300-00392147-96822016-12-013046787012647Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish CorpusÇağdaş ÇapkınInformation institutions use text-based information retrieval systems to store, index and retrieve metadata, full-text, or both metadata and full-text (hybrid) contents. The aim of this research was to evaluate impact of these contents on information retrieval performance. For this purpose, metadata (MIR), full-text (FIR) and hybrid (HIR) content information retrieval systems were developed with default Lucene information retrieval model for a small scale Turkish corpus. In order to evaluate performance of this three systems, “precision - recall” and “normalized recall” tests were conducted. Experimental findings showed that there were no significant differences between MIR and FIR in mean average precision (MAP) performance. On the other hand, MAP performance of HIR was significantly higher in comparison to MIR and FIR. When information retrieval performance was evaluated as user-centered, the “normalized recall” performances of MIR and HIR were significantly higher than FIR. Additionally, there were no significant differences between the systems in retrieved relevant document means. Processing different types of contents such as metadata and full-text had some advantages and disadvantages for information retrieval systems in terms of term management. The advantages brought together in hybrid content processing (HIR) and information retrieval performance improved.http://tk.org.tr/index.php/TK/article/view/2731Bilgi erişimdizinlemeotomatik dizinlemeüstveriperformans değerlendirmeTürk Kütüphaneciliği
spellingShingle Çağdaş Çapkın
Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
Türk Kütüphaneciliği
Bilgi erişim
dizinleme
otomatik dizinleme
üstveri
performans değerlendirme
Türk Kütüphaneciliği
title Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
title_full Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
title_fullStr Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
title_full_unstemmed Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
title_short Impact of Metadata on Full-text Information Retrieval Performance: An Experimental Research on a Small Scale Turkish Corpus
title_sort impact of metadata on full text information retrieval performance an experimental research on a small scale turkish corpus
topic Bilgi erişim
dizinleme
otomatik dizinleme
üstveri
performans değerlendirme
Türk Kütüphaneciliği
url http://tk.org.tr/index.php/TK/article/view/2731
work_keys_str_mv AT cagdascapkın impactofmetadataonfulltextinformationretrievalperformanceanexperimentalresearchonasmallscaleturkishcorpus