Stability of feature selection methods : a study of metrics across different gene expression datasets

Mungloo-Dilmohamud, Zahra; Jaufeerally-Fakim, Yasmina; Peña-Reyes, Carlos Andrés

doi:10.1007/978-3-030-45385-5_59

Stability of feature selection methods : a study of metrics across different gene expression datasets

Mungloo-Dilmohamud, Zahra; Jaufeerally-Fakim, Yasmina; Peña-Reyes, Carlos Andrés

2020

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Cite

Résumé

Analysis of gene-expression data often requires that a gene (feature) subset is selected and many feature selection (FS) methods have been devised. However, FS methods often generate different lists of features for the same dataset and users then have to choose which list to use. One approach to support this choice is to apply stability metrics on the generated lists and selecting lists on that base. The aim of this study is to investigate the behavior of stability metrics applied to feature subsets generated by FS methods. The experiments in this work explore a plethora of gene expression datasets, FS methods, and expected number of features to compare several stability metrics. The stability metrics have been used to compare five feature selection methods (SVM, SAM, ReliefF, RFE + RF and LIMMA) on gene expression datasets from the EBI repository. Results show that the studied stability metrics display a high amount of variability. The reason behind this is not clear yet and is being further investigated. The final objective of the research, that is to define how to select a FS method, is an ongoing work whose partial findings are reported herein.

Détails

Titre

Stability of feature selection methods : a study of metrics across different gene expression datasets

Auteur(s)/ trice(s)

Mungloo-Dilmohamud, Zahra (University of Mauritius, Reduit, Mauritius)
Jaufeerally-Fakim, Yasmina (University of Mauritius, Reduit, Mauritius)
Peña-Reyes, Carlos Andrés (School of Engineering and Management Vaud, HES-SO, University of Applied Sciences and Arts Western Switzerland)

Date

2020-09

Publié dans

Proceedings of 8th International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2020: Bioinformatics and Biomedical Engineering, 30 September-2nd October 2020, Granada, Spain

Volume

2020, pp. 659-669

Publié par

Granada, Spain, 30 September-2nd October 2020

Pagination & équivalents

11 p.

Présenté à

International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO 2020) : Bioinformatics and Biomedical Engineering, Granada, Spain, 2020-09-30, 2020-10-02

ISBN

978-3-030-45384-8

DOI

https://doi.org/10.1007/978-3-030-45385-5_59

Collection et n°

Lectures Notes in Computer Science (LNCS), vol. 12108

Mots-clés (libres)

stability ; stability metrics ; FS methods ; gene expression data

Type de papier

full paper

Domaine

Ingénierie et Architecture

Ecole

HEIG-VD

Institut

IICT - Institut des Technologies de l'Information et de la Communication

Le document apparaît dans

Documents de conférences
Global

Stability of feature selection methods : a study of metrics across different gene expression datasets

Résumé

Détails

Actions