IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

A Bayesian Framework for Improving Clustering Accuracy of Protein Sequences Based on Association Rules

A Bayesian Framework for Improving Clustering Accuracy of Protein Sequences Based on Association Rules
View Sample PDF
Author(s): Peng-Yeng Yin (National Chi Nan University, Taiwan), Shyong-Jian Shyu (Ming Chuan University, Taiwan), Guan-Shieng Huang (National Chi Nan University, Taiwan) and Shuang-Te Liao (Ming Chuan University, Taiwan)
Copyright: 2009
Pages: 15
Source title: Medical Informatics: Concepts, Methodologies, Tools, and Applications
Source Author(s)/Editor(s): Joseph Tan (McMaster University, Canada)
DOI: 10.4018/978-1-60566-050-9.ch171

Purchase

View A Bayesian Framework for Improving Clustering Accuracy of Protein Sequences Based on Association Rules on the publisher's website for pricing and purchasing information.

Abstract

With the advent of new sequencing technology for biological data, the number of sequenced proteins stored in public databases has become an explosion. The structural, functional, and phylogenetic analyses of proteins would benefit from exploring databases by using data mining techniques. Clustering algorithms can assign proteins into clusters such that proteins in the same cluster are more similar in homology than those in different clusters. This procedure not only simplifies the analysis task but also enhances the accuracy of the results. Most of the existing protein-clustering algorithms compute the similarity between proteins based on one-to-one pairwise sequence alignment instead of multiple sequences alignment; the latter is prohibited due to expensive computation. Hence the accuracy of the clustering result is deteriorated. Further, the traditional clustering methods are ad-hoc and the resulting clustering often converges to local optima. This chapter presents a Bayesian framework for improving clustering accuracy of protein sequences based on association rules. The experimental results manifest that the proposed framework can significantly improve the performance of traditional clustering methods.

Related Content

David Edson Ribeiro, Valter Augusto de Freitas Barbosa, Clarisse Lins de Lima, Ricardo Emmanuel de Souza, Wellington Pinheiro dos Santos. © 2021. 15 pages.
Juliana Carneiro Gomes, Maíra Araújo de Santana, Clarisse Lins de Lima, Ricardo Emmanuel de Souza, Wellington Pinheiro dos Santos. © 2021. 12 pages.
Maíra Araújo de Santana, Jessiane Mônica Silva Pereira, Clarisse Lins de Lima, Maria Beatriz Jacinto de Almeida, José Filipe Silva de Andrade, Thifany Ketuli Silva de Souza, Rita de Cássia Fernandes de Lima, Wellington Pinheiro dos Santos. © 2021. 19 pages.
Jessiane Mônica Silva Pereira, Maíra Araújo de Santana, Clarisse Lins de Lima, Rita de Cássia Fernandes de Lima, Sidney Marlon Lopes de Lima, Wellington Pinheiro dos Santos. © 2021. 25 pages.
Adriel dos Santos Araujo, Roger Resmini, Maira Beatriz Hernandez Moran, Milena Henriques de Sousa Issa, Aura Conci. © 2021. 35 pages.
Abir Baâzaoui, Walid Barhoumi. © 2021. 21 pages.
Marcus Costa de Araújo, Luciete Alves Bezerra, Kamila Fernanda Ferreira da Cunha Queiroz, Nadja A. Espíndola, Ladjane Coelho dos Santos, Francisco George S. Santos, Rita de Cássia Fernandes de Lima. © 2021. 44 pages.
Body Bottom