IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities

Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities
Author(s)/Editor(s): Dimple Valayil Paul (Department of Computer Science, Dnyanprassarak Mandal's College and Research Centre, Goa University, Goa, India)
Copyright: ©2021
DOI: 10.4018/978-1-7998-3772-5
ISBN13: 9781799837725
ISBN10: 1799837726
EISBN13: 9781799837732

Purchase

View Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities on the publisher's website for pricing and purchasing information.


Description

The main problems that prevent fast and high-quality document processing in electronic document management systems are insufficient and unstructured information, information redundancy, and the presence of large amounts of undesirable user information. The human factor has a significant impact on the efficiency of document search. An average user is not aware of the advanced option of a query language and uses typical queries. Development of a specialized software toolkit intended for information systems and electronic document management systems can be an effective solution of the tasks listed above. Such toolkits should be based on the means and methods of automatic keyword extraction and text classification. The categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last 10 years due to the increased availability of documents in digital form and the ensuing need to organize them. Thus, research on keyword extraction, advancements in the field, and possible future solutions is of great importance in current times.

Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities presents an information extraction mechanism that can process many kinds of inputs, realize the type of text, and understand the percentage of the keywords that has to be stored. This mechanism then supports information extraction and information categorization mechanisms. This module is used to support a text summarization mechanism, which leads—with the help of the keyword extraction module—to text categorization. It employs lexical and information retrieval techniques to extract phrases from the document text that are likely to characterize it and determines the category of the retrieved text to present a summary to the users. This book is ideal for practitioners, stakeholders, researchers, academicians, and students who are interested in the development of a new keyword extractor and document classifier method.



Author's/Editor's Biography

Dimple Paul
Working as Asst. Professor in the Department of Computer Science since 20 years

More...
Less...

Body Bottom