Process data infrastructure and data services

Cushing, Reginald; Valkering, Onno; Belloum, Adam; Madougou, Souley; Bobak, Martin; Habala, Ondrej; Tran, Viet; Meizner, Jan; Nowakowski, Piotr; Graziani, Mara; Müller, Henning

doi:10.31577/cai_2020_4_724

Process data infrastructure and data services

Cushing, Reginald; Valkering, Onno; Belloum, Adam; Madougou, Souley; Bobak, Martin; Habala, Ondrej; Tran, Viet; Meizner, Jan; Nowakowski, Piotr; Graziani, Mara; Müller, Henning

2021

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Cite

Files

Abstract

Due to energy limitation and high operational costs, it is likely that exascale computing will not be achieved by one or two datacentres but will require many more. A simple calculation, which aggregates the computation power of the 2017 Top500 supercomputers, can only reach 418 petaflops. Companies like Rescale, which claims 1.4 exaflops of peak computing power, describes its infrastructure as composed of 8 million servers spread across 30 datacentres. Any proposed solution to address exascale computing challenges has to take into consideration these facts and by design should aim to support the use of geographically distributed and likely independent datacentres. It should also consider, whenever possible, the co-allocation of the storage with the computation as it would take 3 years to transfer 1 exabyte on a dedicated 100 Gb Ethernet connection. This means we have to be smart about managing data more and more geographically dispersed and spread across different administrative domains. As the natural settings of the PROCESS project is to operate within the European Research Infrastructure and serve the European research communities facing exascale challenges, it is important that PROCESS architecture and solutions are well positioned within the European computing and data management landscape namely PRACE, EGI, and EUDAT. In this paper we propose a scalable and programmable data infrastructure that is easy to deploy and can be tuned to support various data-intensive scientific applications.

Details

Title

Process data infrastructure and data services

Author(s)

Cushing, Reginald (University of Amsterdam, Netherlands)
Valkering, Onno (University of Amsterdam, Netherlands)
Belloum, Adam (University of Amsterdam, Netherlands; Netherlands eScience Center , Netherlands)
Madougou, Souley (Netherlands eScience Center, Netherlands)
Bobak, Martin (Slovak Academy of Sciences, Bratislava, Slovakia)
Habala, Ondrej (Slovak Academy of Sciences, Bratislava, Slovakia)
Tran, Viet (Slovak Academy of Sciences, Bratislava, Slovakia)
Meizner, Jan (AGH University of Science and Technology, Krakow, Poland)
Nowakowski, Piotr (AGH University of Science and Technology, Krakow, Poland)
Graziani, Mara (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))
Müller, Henning (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))

Date

2021-01

Published in

Computers and informatics

Volume

2020, vol. 39, no. 4, pp. 724-756

Pagination & equivalents

25 p.

DOI

https://doi.org/10.31577/cai_2020_4_724

ISSN

1335-9150

Keywords

exascale data management ; distributed file systems ; microservice architecture

Article Type

scientifique

Faculty

Economie et Services

School

HEG-VS

Institute

Institut Informatique de gestion

Record Appears in

Scientific Articles
Global

Process data infrastructure and data services

Files

Abstract

Details

Actions

PDF