IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Feature Selection for Web Page Classification

Feature Selection for Web Page Classification
View Sample PDF
Author(s): K. Selvakuberan (Tata Consultancy Services, India), M. Indra Devi (Thiagarajar College of Engineering, India)and R. Rajaram (Thiagarajar College of Engineering, India)
Copyright: 2009
Pages: 16
Source title: Social Implications of Data Mining and Information Privacy: Interdisciplinary Frameworks and Solutions
Source Author(s)/Editor(s): Ephrem Eyob (Virginia State University, USA)
DOI: 10.4018/978-1-60566-196-4.ch012

Purchase

View Feature Selection for Web Page Classification on the publisher's website for pricing and purchasing information.

Abstract

The World Wide Web serves as a huge, widely distributed, global information service center for news, advertisements, customer information, financial management, education, government, e-commerce and many others. The Web contains a rich and dynamic collection of hyperlink information. The Web page access and usage information provide rich sources for data mining. Web pages are classified based on the content and/or contextual information embedded in them. As the Web pages contain many irrelevant, infrequent, and stop words that reduce the performance of the classifier, selecting relevant representative features from the Web page is the essential preprocessing step. This provides secured accessing of the required information. The Web access and usage information can be mined to predict the authentication of the user accessing the Web page. This information may be used to personalize the information needed for the users and to preserve the privacy of the users by hiding the personal details. The issue lies in selecting the features which represent the Web pages and processing the details of the user needed the details. In this chapter we focus on the feature selection, issues in feature selection, and the most important feature selection techniques described and used by researchers.

Related Content

. © 2023. 34 pages.
. © 2023. 15 pages.
. © 2023. 15 pages.
. © 2023. 18 pages.
. © 2023. 24 pages.
. © 2023. 32 pages.
. © 2023. 21 pages.
Body Bottom