Effect of the named entity recognition and sliding window on the HONcode automated detection of HONcode criteria for mass health online content

Boyer, Celia; Dolamic, Ljiljana; Ruch, Patrick; Falquet, Gilles

doi:10.5220/0005644301510158

Effect of the named entity recognition and sliding window on the HONcode automated detection of HONcode criteria for mass health online content

Boyer, Celia; Dolamic, Ljiljana; Ruch, Patrick; Falquet, Gilles

2016

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Cite

Résumé

The Health On the Net’s Foundation (HON) Code of Conduct, HONcode, is the oldest and the most used ethical and trustworthy code for medical and health related information available on the Internet. Until recently, websites voluntarily applying for the HONcode seal were evaluated manually by an expert medical team according to 8 principles, referred to as criteria, and associated published guidelines. In the scope of the European project Kconnect, HON is developing an automated system to identify the 8 HONcode criteria within health webpages. When the research on the development of such a system evolved from simple algorithmic testing to a real full-content setting, it revealed a number of issues. The preceding study consisted in taking a set of 27 health-related websites and having them assessed for their compliance to each of the 8 HONcode criterion, first manually by senior HONcode experts, and then through supervised machine learning by the automated system. The results showed disc repancies mainly for two criteria: “submerged content” under the Complementarity criterion and “extremely low recall” under the Date Attribution criterion. In this article, the authors investigate different approaches to solve the problems related to each of these criteria, namely a customized Named Entity Recognition Model instead of a machine learning component for Date Attribution, and a sliding window instead of the whole document as a unit of detection for Complementarity. The results obtained show that the newly adapted automated system greatly improves accuracy: 74% vs. 41% for the Date Attribution criterion and 74% vs. 22% for the Complementarity criterion.

Détails

Titre

Effect of the named entity recognition and sliding window on the HONcode automated detection of HONcode criteria for mass health online content

Auteur(s)/ trice(s)

Boyer, Celia (Health On the Net Foundation, Switzerland)
Dolamic, Ljiljana (Health On the Net Foundation, Switzerland)
Ruch, Patrick (Haute école de gestion de Genève, HES-SO Haute Ecole Spécialisée de Suisse Occidentale)
Falquet, Gilles (University of Geneva, Switzerland)

Date

2016-02

Publié dans

Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies

Volume

2016

Publié par

Rome, Italy, 21-23 February 2016

Pagination & équivalents

8 p.

Présenté à

9th International Joint Conference on Biomedical Engineering Systems and Technologies, Rome, Italy, 21/02/2016 / 23/02/2016

DOI

https://doi.org/10.5220/0005644301510158

Collection et n°

HEALTHINF, vol. 5

Mots-clés (libres)

HONCODE ; automated detection ; manual detection ; machine learning ; named entity recognition

Type de papier

full paper

Domaine

Economie et Services

Ecole

HEG - Genève

Institut

CRAG - Centre de Recherche Appliquée en Gestion

Le document apparaît dans

Documents de conférences
Global

Ressource(s) externe(s)

https://www.scitepress.org/Papers/2016/56443/56443.pdf

Effect of the named entity recognition and sliding window on the HONcode automated detection of HONcode criteria for mass health online content

Résumé

Détails

Actions

PDF