Please use this identifier to cite or link to this item: https://hdl.handle.net/11108/315
Title: 

Descriptor-Invariant Fusion Architectures for Automatic Subject Indexing

Authors: 
Toepfer, Martin
Seifert, Christin
Year of Publication: 
2017
Citation: 
[Title:] Proceedings of Joint Conference on Digital Libraries, 2017 [ISBN:] 978-1-5386-3861-3 [Publisher:] IEEE [Place:] Piscataway [Pages:] 1-10
Abstract: 
Documents indexed with controlled vocabularies enable users of libraries to discover relevant documents, even across language barriers. Due to the rapid growth of scientific publications, digital libraries require automatic methods that index documents accurately, especially with regard to explicit or implicit concept drift, that is, with respect to new descriptor terms and new types of documents, respectively. This paper first analyzes architectures of related approaches on automatic indexing. We show that their design determines individual strengths and weaknesses and justify research on their fusion. In particular, systems benefit from statistical associative components as well as from lexical components applying dictionary matching, ranking, and binary classification. The analysis emphasizes the importance of descriptor-invariant learning, that is, learning based on features which can be transferred between different descriptors. Theoretic and experimental results on economic titles and author keywords underline the relevance of the fusion methodology in terms of overall accuracy and adaptability to dynamic domains. Experiments show that fusion strategies combining a binary relevance approach and a thesaurus-based system outperform all other strategies on the tested data set. Our findings can help researchers and practitioners in digital libraries to choose appropriate methods for automatic indexing.
Subjects: 
Indexing
Vocabulary
Thesauri
Computer Architecture
Economics
Libraries
Machine assisted indexing
Persistent Identifier of the first edition: 

Files in This Item:
There are no files associated with this item.





Items in ZBWPub are protected by copyright, with all rights reserved, unless otherwise indicated.