Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

Zufferey, Damien; Hofer, Thomas; Hennebert, Jean; Schumacher, Michael; Ingold, Rolf; Bromuri, Stefano

doi:10.1016/j.compbiomed.2015.07.017

Zufferey, Damien; Hofer, Thomas; Hennebert, Jean; Schumacher, Michael; Ingold, Rolf; Bromuri, Stefano

2015

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

We are motivated by the issue of classifying diseases of chronically ill patients to assist physicians in their everyday work. Our goal is to provide a performance comparison of state-of-the-art multi-label learning algorithms for the analysis of multivariate sequential clinical data from medical records of patients affected by chronic diseases. As a matter of fact, the multi-label learning approach appears to be a good candidate for modeling overlapped medical conditions, specific to chronically ill patients. With the availability of such comparison study, the evaluation of new algorithms should be enhanced. According to the method, we choose a summary statistics approach for the processing of the sequential clinical data, so that the extracted features maintain an interpretable link to their corresponding medical records. The publicly available MIMIC-II dataset, which contains more than 19,000 patients with chronic diseases, is used in this study. For the comparison we selected the following multi-label algorithms: ML-kNN, AdaBoostMH, binary relevance, classifier chains, HOMER and RAkEL. Regarding the results, binary relevance approaches, despite their elementary design and their independence assumption concerning the chronic illnesses, perform optimally in most scenarios, in particular for the detection of relevant diseases. In addition, binary relevance approaches scale up to large dataset and are easy to learn. However, the RAkEL algorithm, despite its scalability problems when it is confronted to large dataset, performs well in the scenario which consists of the ranking of the labels according to the dominant disease of the patient.

Détails

Titre

Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

Auteur(s)/ trice(s)

Zufferey, Damien (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))
Hofer, Thomas (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))
Hennebert, Jean (DIVA Group, Department of Informatics, University of Fribourg)
Schumacher, Michael (Schumacher, Michael)
Ingold, Rolf (DIVA Group, Department of Informatics, University of Fribourg)
Bromuri, Stefano (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))

Date

2015-10

Publié dans

Computers in biology and medicine

Volume

October 2015, vol. 65, pp. 34–43

DOI

https://doi.org/10.1016/j.compbiomed.2015.07.017

ISSN

0010-4825

Mots-clés (libres)

multi-label learning ; complex patient ; chronic disease ; clinical data ; summary statistics

Type d'article

scientifique

Domaine

Economie et Services
Ingénierie et Architecture

Ecole

HEIA-FR
HEG-VS

Institut

iCoSys- Institut d’intelligence artificielle et systèmes complexes
Institut Informatique de gestion

Note

HENNEBERT, Jean est chercheur à la HEIA-FR, HES-SO, depuis 2011.

Le document apparaît dans

Articles scientifiques
Global

Résumé

Détails

Actions

PDF