IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

An Analysis of Tools for an Automatic Extraction of Concept in Documents for a Better Knowledge Management

An Analysis of Tools for an Automatic Extraction of Concept in Documents for a Better Knowledge Management
View Free PDF
Author(s): Rocio Abascal (INSA of Lyon - LISI, France), Béatrice Rumpler (INSA of Lyon - LISI, France) and Jean-Marie Pinon (INSA of Lyon - LISI, France)
Copyright: 2003
Pages: 4
Source title: Information Technology & Organizations: Trends, Issues, Challenges & Solutions
Source Editor(s): Mehdi Khosrow-Pour, D.B.A. (Information Resources Management Association, USA)
DOI: 10.4018/978-1-59140-066-0.ch055
ISBN13: 9781616921248
EISBN13: 9781466665330

Abstract

In our project about the digital library (DL) of scientific theses, we need to allow the user an access to the most pertinent information. Therefore, it is important to extract the main concepts to improve the information retrieval in this area. This article represents an empirical evaluation of four tools for automatically extracting concepts from documents. We have compared these tools by using different document collections. For each document, we have extracted manually a list of concepts tied to the main topics. The tools are evaluated according to the degree of similitude between the concepts defined manually and the concepts automatically extracted by these tools. The four evaluated tools are: (1) TerminologyExtractor of Chamblon Systems Inc., (2) Xerox Terminology Suite of Xerox, (3) Nomino of Nomino Technologies, (4) Copernic Summarizer of NRC. This article presents the criteria of evaluation, a comparative study of the tools, an evaluation of the results and a proposed tool to annotate documents based on the concepts extracted.

Body Bottom