Hybrid human-machine classification system for cultural heritage data

Shabani, Shaban; Sokhn, Maria; Schuldt, Heiko

doi:10.1145/3423323.3423413

Hybrid human-machine classification system for cultural heritage data

Shabani, Shaban; Sokhn, Maria; Schuldt, Heiko

2020

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

The advancement of digital technologies has helped cultural heritage organizations to digitize their data collections and improve the accessibility via online platforms. These platforms have enabled citizens to contribute to the process of digital preservation of cultural heritage by sharing documents and their knowledge. However, many historical datasets have problems due to incomplete metadata. To solve this issue, cultural heritage organizations heavily depend on domain experts. In this paper, we address the issue of completing the metadata of historical digital collections. For this, we introduce a new hybrid human-machine model. This model jointly integrates predictions of a deep multi-input model and inferred labels from multiple crowd judgements. The multi-input model uses visual features extracted from the images and textual features from the metadata, complemented with Wikipedia classes of concepts extracted in the text. On the crowd answer aggregation, our method considers the workers' reliability scores. This score is based on the performance of workers' task history and their performance in our task. We have applied our hybrid approach to a culture heritage platform and the evaluations show that it outperforms both deep learning and crowdsourcing when applied individually.

Détails

Titre

Hybrid human-machine classification system for cultural heritage data

Auteur(s)/ trice(s)

Shabani, Shaban (Haute école de gestion Arc, HES-SO Haute Ecole Spécialisée de Suisse Occidentale)
Sokhn, Maria (Haute école de gestion Arc, HES-SO Haute Ecole Spécialisée de Suisse Occidentale)
Schuldt, Heiko (Mathematics and computer science, University of Basel, Basel, Switzerland)

Date

2020-10

Publié dans

Proceedings of the 2nd Workshop on Structuring and Understanding of Multimedia heritAge Contents (SUMAC'2020)

Publié par

Seattle, USA, 12 October 2020

Pagination & équivalents

Pp. 49–56

Présenté à

2nd Workshop on Structuring and Understanding of Multimedia heritAge Contents, Seattle, USA, 2020-10-12, 2020-10-12

ISBN

9781450381550

DOI

https://doi.org/10.1145/3423323.3423413

Mots-clés (libres)

cultural heritage ; deep learning ; crowdsourcing ; hybrid human-machine information systems

Type de papier

published full paper

Domaine

Economie et Services

Ecole

HEG Arc

Institut

IDO - Institut de Digitalisation des organisations

Note

Due to the COVID-19 outbreak, the 2nd Workshop on Structuring and Understanding of Multimedia heritAge Contents conference venue in Seattle was cancelled. The proceedings of the online conference are however published according to the original schedule

Le document apparaît dans

Documents de conférences
Global

Hybrid human-machine classification system for cultural heritage data

Résumé

Détails

Actions

PDF