IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH®

Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH®
View Sample PDF
Author(s): Dimitrios Kokkinakis (University of Gothenburg, Sweden)
Copyright: 2009
Pages: 27
Source title: Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration
Source Author(s)/Editor(s): Violaine Prince (University Montpellier 2, France)and Mathieu Roche (University Montpellier 2, France)
DOI: 10.4018/978-1-60566-274-9.ch002

Purchase

View Lexical Granularity for Automatic Indexing and Means to Achieve It: The Case of Swedish MeSH® on the publisher's website for pricing and purchasing information.

Abstract

The identification and mapping of terminology from large repositories of life science data onto concept hierarchies constitute an important initial step for a deeper semantic exploration of unstructured textual content. Accurate and efficient mapping of this kind is likely to provide better means of enhancing indexing and retrieval of text, uncovering subtle differences, similarities and useful patterns, and hopefully new knowledge, among complex surface realisations, overlooked by shallow techniques based on various forms of lexicon look-up approaches. However, a finer-grained level of mapping between terms as they occur in natural language and domain concepts is a cumbersome enterprise that requires various levels of processing in order to make explicit relevant linguistic structures. This chapter highlights some of the challenges encountered in the process of bridging free text to controlled vocabularies and thesauri and vice versa. The author investigates how the extensive variability of lexical terms in authentic data can be efficiently projected to hierarchically structured codes, while means to increase the coverage of the underlying lexical resources are also investigated.

Related Content

Linkon Chowdhury, Md Sarwar Kamal, Shamim H. Ripon, Sazia Parvin, Omar Khadeer Hussain, Amira Ashour, Bristy Roy Chowdhury. © 2024. 20 pages.
Mousomi Roy. © 2024. 21 pages.
Nassima Dif, Zakaria Elberrichi. © 2024. 20 pages.
Pyingkodi Maran, Shanthi S., Thenmozhi K., Hemalatha D., Nanthini K.. © 2024. 16 pages.
Mohamed Nadjib Boufenara, Mahmoud Boufaida, Mohamed Lamine Berkane. © 2024. 16 pages.
Meroua Daoudi, Souham Meshoul, Samia Boucherkha. © 2024. 25 pages.
Zhongyu Lu, Qiang Xu, Murad Al-Rajab, Lamogha Chiazor. © 2024. 56 pages.
Body Bottom