Feature Selection Algorithms for Classification and Clustering in Bioinformatics

View Sample PDF

Author(s): Sujata Dash (Gandhi Institute for Technology, India)and Bichitrananda Patra (KMBB College of Engineering and Technology, India)
Copyright: 2014
Pages: 20
Source title: Global Trends in Intelligent Computing Research and Development
Source Author(s)/Editor(s): B.K. Tripathy (VIT University, India)and D. P. Acharjya (VIT University, India)
DOI: 10.4018/978-1-4666-4936-1.ch005

Keywords: Artificial Intelligence / Computational Intelligence / Computer Science & IT / Information Science Reference

Purchase

View Feature Selection Algorithms for Classification and Clustering in Bioinformatics on the publisher's website for pricing and purchasing information.

Abstract

This chapter discusses some important issues such as pre-processing of gene expression data, curse of dimensionality, feature extraction/selection, and measuring or estimating classifier performance. Although these concepts are relatively well understood among the technical people such as statisticians, electrical engineers, and computer scientists, they are relatively new to biologists and bioinformaticians. As such, it was observed that there are still some misconceptions about the use of classification methods. For instance, in most classifier design strategies, the gene or feature selection is an integral part of the classifier, and as such, it must be a part of the cross-validation process that is used to estimate the classifier prediction performance. Simon (2003) discussed several studies that appeared in prestigious journals where this important issue is overlooked, and optimistically biased prediction performances were reported. Furthermore, the authors have also discuss important properties such as generalizability or sensitivity to overtraining, built-in feature selection, ability to report prediction strength, and transparency of different approaches to provide a quick and concise reference. The classifier design and clustering methods are relatively well established; however, the complexity of the problems rooted in the microarray technology hinders the applicability of the classification methods as diagnostic and prognostic predictors or class-discovery tools in medicine.

The IRMA Community

Research IRM

Feature Selection Algorithms for Classification and Clustering in Bioinformatics

Purchase

Abstract

Related Content

IRMA Sponsors