The IRMA Community
Newsletters
Research IRM
Click a keyword to search titles using our InfoSci-OnDemand powered search:
|
Best Practices of Feature Selection in Multi-Omics Data
|
Author(s): Funda Ipekten (Erciyes University, Turkey), Gözde Ertürk Zararsız (Erciyes University, Turkey), Halef Okan Doğan (Cumhuriyet University, Turkey), Vahap Eldem (Istanbul University, Turkey)and Gökmen Zararsız (Erciyes University, Turkey)
Copyright: 2023
Pages: 15
Source title:
Encyclopedia of Data Science and Machine Learning
Source Author(s)/Editor(s): John Wang (Montclair State University, USA)
DOI: 10.4018/978-1-7998-9220-5.ch122
Purchase
|
Abstract
With the recent advances in molecular biology techniques such as next-generation sequencing, mass-spectrometry, etc., a large omic data is produced. Using such data, the expression levels of thousands of molecular features (genes, proteins, metabolites, etc.) can be quantified and associated with diseases. The fact that multiple omics data contains different types of data and the number of analyzed variables increases the complexity of the models created with machine learning methods. In addition, due to many variables, the investigation of molecular variables associated with diseases is very costly. Therefore, selecting the informative and disease-related molecular features is applicable before model training and evaluation. This feature selection step is essential for obtaining accurate and generalizable models in minimum time with minimum cost. Some current methods used for feature selection are as follows: recursive feature elimination, information gain, minimum redundancy maximum relevance (mRMR), boruta, altmann, and lasso.
Related Content
Princy Pappachan, Sreerakuvandana, Mosiur Rahaman.
© 2024.
26 pages.
|
Winfred Yaokumah, Charity Y. M. Baidoo, Ebenezer Owusu.
© 2024.
23 pages.
|
Mario Casillo, Francesco Colace, Brij B. Gupta, Francesco Marongiu, Domenico Santaniello.
© 2024.
25 pages.
|
Suchismita Satapathy.
© 2024.
19 pages.
|
Xinyi Gao, Minh Nguyen, Wei Qi Yan.
© 2024.
13 pages.
|
Mario Casillo, Francesco Colace, Brij B. Gupta, Angelo Lorusso, Domenico Santaniello, Carmine Valentino.
© 2024.
30 pages.
|
Pratyay Das, Amit Kumar Shankar, Ahona Ghosh, Sriparna Saha.
© 2024.
32 pages.
|
|
|