IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Document Classification

Document Classification
View Sample PDF
Copyright: 2021
Pages: 5
Source title: Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities
Source Author(s)/Editor(s): Dimple Valayil Paul (Department of Computer Science, Dnyanprassarak Mandal's College and Research Centre, Goa University, Goa, India)
DOI: 10.4018/978-1-7998-3772-5.ch007

Purchase

View Document Classification on the publisher's website for pricing and purchasing information.

Abstract

Keywords can be used as attributes for mining rules or as a basis for measuring the similarity of new (unclassified) documents with existing (classified) ones. The focus is on the problem of extracting keywords from document collection in order to use them as attributes for document classification. Document classification is a hot topic in machine learning. Typical approaches extract “features,” generally words, from document, and use the feature vectors as input to a machine learning scheme that learns how to classify documents. This “bag of keywords” model neglects keyword order and contextual effects.

Related Content

M. Govindarajan. © 2022. 23 pages.
Rajab Ssemwogerere, Wamwoyo Faruk, Nambobi Mutwalibi. © 2022. 33 pages.
Surabhi Verma, Ankit Kumar Jain. © 2022. 34 pages.
Kriti Aggarwal, Sunil K. Singh, Muskaan Chopra, Sudhakar Kumar. © 2022. 25 pages.
Praneeth Gunti, Brij B. Gupta, Elhadj Benkhelifa. © 2022. 26 pages.
Yin-Chun Fung, Lap-Kei Lee, Kwok Tai Chui, Gary Hoi-Kit Cheung, Chak-Him Tang, Sze-Man Wong. © 2022. 13 pages.
Lap-Kei Lee, Kwok Tai Chui, Jingjing Wang, Yin-Chun Fung, Zhanhui Tan. © 2022. 16 pages.
Body Bottom