open access publication

Article, 2024

The effect of harmonization on the variability of PET radiomic features extracted using various segmentation methods

Annals of Nuclear Medicine, ISSN 1864-6433, 0914-7187, Volume 38, 7, Pages 493-507, 10.1007/s12149-024-01923-7

Contributors

Hosseini, Seyyed Ali 0000-0001-7542-7541 [1] [2] Shiri, Isaac 0000-0002-5735-0736 [3] Ghaffarian, Pardis [4] Hajianfar, Ghasem 0000-0001-5359-2407 [3] Avval, Atlas Haddadi 0000-0002-3896-7810 [5] Seyfi, Milad 0000-0001-5908-6150 [6] Servaes, Stijn [1] [2] Rosa-Neto, Pedro Rosa 0000-0001-9116-1376 [1] [2] Zaidi, Habib 0000-0001-7559-5297 (Corresponding author) [3] [7] [8] [9] Ay, Mohammad Reza [6]

Affiliations

  1. [1] Douglas Mental Health University Institute
  2. [NORA names: Canada; America, North; OECD];
  3. [2] McGill University
  4. [NORA names: Canada; America, North; OECD];
  5. [3] University Hospital of Geneva
  6. [NORA names: Switzerland; Europe, Non-EU; OECD];
  7. [4] Shahid Beheshti University of Medical Sciences
  8. [NORA names: Iran; Asia, Middle East];
  9. [5] Mashhad University of Medical Sciences
  10. [NORA names: Iran; Asia, Middle East];

Abstract

PurposeThis study aimed to examine the robustness of positron emission tomography (PET) radiomic features extracted via different segmentation methods before and after ComBat harmonization in patients with non-small cell lung cancer (NSCLC).MethodsWe included 120 patients (positive recurrence = 46 and negative recurrence = 74) referred for PET scanning as a routine part of their care. All patients had a biopsy-proven NSCLC. Nine segmentation methods were applied to each image, including manual delineation, K-means (KM), watershed, fuzzy-C-mean, region-growing, local active contour (LAC), and iterative thresholding (IT) with 40, 45, and 50% thresholds. Diverse image discretizations, both without a filter and with different wavelet decompositions, were applied to PET images. Overall, 6741 radiomic features were extracted from each image (749 radiomic features from each segmented area). Non-parametric empirical Bayes (NPEB) ComBat harmonization was used to harmonize the features. Linear Support Vector Classifier (LinearSVC) with L1 regularization For feature selection and Support Vector Machine classifier (SVM) with fivefold nested cross-validation was performed using StratifiedKFold with ‘n_splits’ set to 5 to predict recurrence in NSCLC patients and assess the impact of ComBat harmonization on the outcome.ResultsFrom 749 extracted radiomic features, 206 (27%) and 389 (51%) features showed excellent reliability (ICC ≥ 0.90) against segmentation method variation before and after NPEB ComBat harmonization, respectively. Among all, 39 features demonstrated poor reliability, which declined to 10 after ComBat harmonization. The 64 fixed bin widths (without any filter) and wavelets (LLL)-based radiomic features set achieved the best performance in terms of robustness against diverse segmentation techniques before and after ComBat harmonization. The first-order and GLRLM and also first-order and NGTDM feature families showed the largest number of robust features before and after ComBat harmonization, respectively. In terms of predicting recurrence in NSCLC, our findings indicate that using ComBat harmonization can significantly enhance machine learning outcomes, particularly improving the accuracy of watershed segmentation, which initially had fewer reliable features than manual contouring. Following the application of ComBat harmonization, the majority of cases saw substantial increase in sensitivity and specificity.ConclusionRadiomic features are vulnerable to different segmentation methods. ComBat harmonization might be considered a solution to overcome the poor reliability of radiomic features.

Keywords

Bay, ComBat harmonization, ConclusionRadiomics features, Fuzzy-C-Means, GLRLM, K-means (KM, L1 regularization, Linear Support Vector Classifier, LinearSVC, MethodsWe, PET radiomic features, PurposeThis, PurposeThis study, SVM, accuracy, active contours, applications, bin width, biopsy-proven non-small cell lung cancer, cancer, care, cases, cell lung cancer, classifier, combat, contour, cross-validation, decomposition, delineation, discretization, effect, effect of harmonics, emission tomography, empirical Bayes, excellent reliability, family, feature family, features, filter, findings, first-order, fixed bin width, harmonics, image discretization, images, impact, increase, iterative thresholding, learning outcomes, local active contours, lung cancer, machine, machine learning outcomes, manual contouring, manual delineation, method, method variations, nested cross-validation, non-small cell lung cancer, non-small cell lung cancer patients, outcomes, parts, patients, performance, poor reliability, positron emission tomography, positron emission tomography imaging, positron emission tomography scan, predicting recurrence, radiomic features, recurrence, region-growing, regularization, reliability, reliability of radiomic features, robust features, robustness, routine part, scanning, segmentation method, segmentation techniques, segments, selection, sensitivity, solution, specificity, study, support vector classifier, technique, threshold, tomography, variables, variation, vector classifier, watershed, watershed segmentation, wavelet, wavelet decomposition, width

Funders

  • Swiss National Science Foundation

Data Provider: Digital Science