IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Document Classification

Document Classification
View Sample PDF
Copyright: 2021
Pages: 5
Source title: Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities
Source Author(s)/Editor(s): Dimple Valayil Paul (Department of Computer Science, Dnyanprassarak Mandal's College and Research Centre, Goa University, Goa, India)
DOI: 10.4018/978-1-7998-3772-5.ch007

Purchase

View Document Classification on the publisher's website for pricing and purchasing information.

Abstract

Keywords can be used as attributes for mining rules or as a basis for measuring the similarity of new (unclassified) documents with existing (classified) ones. The focus is on the problem of extracting keywords from document collection in order to use them as attributes for document classification. Document classification is a hot topic in machine learning. Typical approaches extract “features,” generally words, from document, and use the feature vectors as input to a machine learning scheme that learns how to classify documents. This “bag of keywords” model neglects keyword order and contextual effects.

Related Content

. © 2021. 37 pages.
. © 2021. 14 pages.
. © 2021. 31 pages.
. © 2021. 21 pages.
. © 2021. 15 pages.
. © 2021. 13 pages.
. © 2021. 5 pages.
Body Bottom