open access publication

Article, 2024

End-to-end volumetric segmentation of white matter hyperintensities using deep learning

Computer Methods and Programs in Biomedicine, ISSN 1872-7565, 0169-2607, Volume 245, Page 108008, 10.1016/j.cmpb.2024.108008

Contributors

Farkhani, Sadaf 0000-0002-6911-3586 (Corresponding author) [1] Demnitz, Naiara 0000-0002-1481-2398 [1] Boraxbekk, Carl-Johan 0000-0002-4458-6475 [1] [2] [3] [4] Lundell, Henrik M 0000-0002-7044-442X [1] [5] Siebner, Hartwig Roman 0000-0002-3756-9431 [1] [3] [4] Petersen, Esben Thade 0000-0001-7529-3432 [1] [5] Madsen, Kristoffer Hougaard 0000-0001-8606-7641 [1] [5]

Affiliations

  1. [1] Amager and Hvidovre Hospital
  2. [NORA names: Capital Region of Denmark; Hospital; Denmark; Europe, EU; Nordic; OECD];
  3. [2] Bispebjerg Hospital
  4. [NORA names: Capital Region of Denmark; Hospital; Denmark; Europe, EU; Nordic; OECD];
  5. [3] Copenhagen University Hospital
  6. [NORA names: Capital Region of Denmark; Hospital; Denmark; Europe, EU; Nordic; OECD];
  7. [4] University of Copenhagen
  8. [NORA names: KU University of Copenhagen; University; Denmark; Europe, EU; Nordic; OECD];
  9. [5] Technical University of Denmark
  10. [NORA names: DTU Technical University of Denmark; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

BACKGROUND AND OBJECTIVES: Reliable detection of white matter hyperintensities (WMH) is crucial for studying the impact of diffuse white-matter pathology on brain health and monitoring changes in WMH load over time. However, manual annotation of 3D high-dimensional neuroimages is laborious and can be prone to biases and errors in the annotation procedure. In this study, we evaluate the performance of deep learning (DL) segmentation tools and propose a novel volumetric segmentation model incorporating self-attention via a transformer-based architecture. Ultimately, we aim to evaluate diverse factors that influence WMH segmentation, aiming for a comprehensive analysis of the state-of-the-art algorithms in a broader context. METHODS: We trained state-of-the-art DL algorithms, and incorporated advanced attention mechanisms, using structural fluid-attenuated inversion recovery (FLAIR) image acquisitions. The anatomical MRI data utilized for model training was obtained from healthy individuals aged 62-70 years in the Live active Successful Aging (LISA) project. Given the potential sparsity of lesion volume among healthy aging individuals, we explored the impact of incorporating a weighted loss function and ensemble models. To assess the generalizability of the studied DL models, we applied the trained algorithm to an independent subset of data sourced from the MICCAI WMH challenge (MWSC). Notably, this subset had vastly different acquisition parameters compared to the LISA dataset used for training. RESULTS: Consistently, DL approaches exhibited commendable segmentation performance, achieving the level of inter-rater agreement comparable to expert performance, ensuring superior quality segmentation outcomes. On the out of sample dataset, the ensemble models exhibited the most outstanding performance. CONCLUSIONS: DL methods generally surpassed conventional approaches in our study. While all DL methods performed comparably, incorporating attention mechanisms could prove advantageous in future applications with a wider availability of training data. As expected, our experiments indicate that the use of ensemble-based models enables the superior generalization in out-of-distribution settings. We believe that introducing DL methods in the WHM annotation workflow in heathy aging cohorts is promising, not only for reducing the annotation time required, but also for eventually improving accuracy and robustness via incorporating the automatic segmentations in the evaluation procedure.

Keywords

DL algorithms, DL approaches, DL methods, DL models, MICCAI, MRI data, MWSC, WHM, WMH load, WMH segmentation, accuracy, acquisition, acquisition parameters, age, age cohorts, aged individuals, agreement, algorithm, analysis, anatomical MRI data, annotation, annotation procedure, annotation time, annotation workflow, applications, approach, architecture, attention, attention mechanism, automatic segmentation, availability, availability of training data, background, bias, brain, brain health, challenges, changes, cohort, comprehensive analysis, context, conventional approaches, data, dataset, deep learning, detection, detection of white matter hyperintensities, diverse factors, end-to-end, ensemble, ensemble model, ensemble-based model, error, evaluation, evaluation procedure, experiments, expert performance, factors, fluid-attenuated inversion recovery, function, generalizability, generalization, health, healthy aged individuals, healthy individuals, high-dimensional neuroimaging, hyperintensities, image acquisition, images, impact, improved accuracy, individuals, inter-rater agreement, inversion recovery, learning, lesion volume, level of inter-rater agreement, levels, load, loss function, manual annotation, mechanism, method, model, model training, monitoring, monitoring changes, neuroimaging, outcomes, parameters, pathology, performance, performance of deep learning, potential sparsity, procedure, recovery, robustness, sample dataset, samples, segmentation model, segmentation of white matter hyperintensities, segmentation outcomes, segmentation performance, segments, self-attention, state-of-the-art, state-of-the-art DL algorithms, state-of-the-art algorithms, study, successful aging, superior generalization, time, training, training algorithm, training data, transformer-based architectures, volume, weighted loss function, white matter hyperintensities, white-matter pathology, workflow, years

Funders

  • Lundbeck Foundation
  • Capital Region of Denmark

Data Provider: Digital Science