Power efficient hardware acceleration of genomic algorithms

Wertenbroek, Rick

Power efficient hardware acceleration of genomic algorithms

Wertenbroek, Rick

2025

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Genome sequence analysis plays an essential role in scientific and medical research, with applications spanning disease analysis, personalized medicine, epidemiology, forensics, evolutionary biology, and population genetics. Recent advancements in DNA sequencing technologies have led to an explosion in data generation, far outpacing the growth of computational power. As large-scale projects, such as the UK Biobank, which includes 500,000 sequenced individuals and associated biomedical data, become increasingly common, the computational burden intensifies, exacerbating existing bottlenecks and significantly raising energy consumption. Addressing these challenges is crucial to ensure that genomic research remains both scalable and sustainable. This thesis focuses on accelerating genomic data processing while reducing its overall energy footprint. Several strategies are explored to achieve this goal. First, we introduce a novel genotype compression format that reduces storage requirements and enhances computational efficiency by enabling faster data access and allowing direct processing of compressed data, a concept known as "compressive genomics". We then present a parallelized version of the positional Burrows-Wheeler transform and associated algorithms, designed to leverage modern multi-core processors and accelerate genetic applications such as haplotype estimation and population structure analysis. Additionally, we propose a cloud-distributed method capable of efficiently processing population-scale whole-genome sequencing data, improving the statistical phasing of hundreds of thousands of genomes at petabyte scale. Finally, we introduce innovative hardware in the form of computational storage devices, which not only store data but are also capable of processing it locally. We demonstrate their potential for acceleration and energy efficiency by designing a computational storage device specifically for genomics. This device integrates a complete genomic analysis pipeline, from DNA sequence alignment to variant calling, directly within the storage hardware. This integration minimizes data movement, reduces energy consumption, and provides acceleration opportunities. By combining advances in compression, algorithmic optimization, could-scale processing, and hardware architecture innovation, this work offers a comprehensive approach to accelerating genomic data analysis while improving energy efficiency. These contributions not only enable faster and deeper genomic research but also lay the foundation for sustainable, large-scale genomics studies.

Détails

Titre

Power efficient hardware acceleration of genomic algorithms

Auteur(s)/ trice(s)

Wertenbroek, Rick (Université de Lausanne, Lausanne, Switzerland)

Directeur(s)/ trice(s)

Xenarios, Ioannis director (University of Lausanne, Lausanne, Switzerland)
Thoma, Yann director (School of Engineering and Management Vaud, HES-SO University of Applied Sciences and Arts Western Switzerland)
Delaneau, Olivier director (Regeneron, Tarrytown, NY, USA)

Date

2025-06

Publié par

Lausanne, Switzerland, UNIL

Pagination & équivalents

200 p.

Mots-clés (libres)

genomics ; genetics ; acceleration ; energy efficiency ; hardware architecture ; genotype compression ; haplotype estimation ; cloud computing ; computational storage

Domaine

Ingénierie et Architecture

Ecole

HEIG-VD

Institut

ReDS - Reconfigurable & embedded Digital Systems

Le document apparaît dans

Masters et Doctorats
Global

Type de travail

Thèse

Ressource(s) externe(s)

UNIL

Power efficient hardware acceleration of genomic algorithms

Résumé

Détails

Actions

PDF