Deep multimodal classification of image types in biomedical journal figures

Adrearczyk, Vincent; Müller, Henning

doi:10.1007/978-3-319-98932-7

Adrearczyk, Vincent; Müller, Henning

2018

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

This paper presents a robust method for the classification of medical image types in figures of the biomedical literature using the fusion of visual and textual information. A deep convolutional network is trained to discriminate among 31 classes including compound figures, diagnostic image types and generic illustrations, while another shallow convolutional network is used for the analysis of the captions paired with the images. Various fusion methods are analyzed as well as data augmentation approaches. The proposed system is validated on the ImageCLEF 2013 classification task, largely improving the currently best performance from 83.5% to 93.7% accuracy.

Détails

Titre

Deep multimodal classification of image types in biomedical journal figures

Auteur(s)/ trice(s)

Adrearczyk, Vincent (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))
Müller, Henning (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis) ; University of Geneva, Switzerland)

Editeur(s) scientifique(s)

Bellot, P. ; (ed.)
Trabelsi, C. ; (ed.)
Mothe, J ; (ed.)

Date

2018-09

Publié dans

Experimental IR meets multilinguality, multimodality, and interaction : 9th International Conference of the CLEF Association, CLEF 2018, Avignon, France, September 10-14, 2018, Proceedings

Publié par

Cham, Springer

Pagination

Pp. 3-14

ISBN

978-3-319-98931-0

DOI

https://doi.org/10.1007/978-3-319-98932-7

Domaine

Economie et Services

Ecole

HEG-VS

Institut

Institut Informatique de gestion

Ressource(s) externe(s)

Accès au catalogue des bibliothèques

Le document apparaît dans

Chapitres de livres
Global

Résumé

Détails

Actions