Article, 2024

A Pipeline for Evaluation of Machine Learning/Artificial Intelligence Models to Quantify Programmed Death Ligand 1 Immunohistochemistry

Laboratory Investigation, ISSN 1530-0307, 0023-6837, Volume 104, 6, Page 102070, 10.1016/j.labinv.2024.102070

Contributors

Knudsen, Beatrice S 0000-0002-7589-7591 (Corresponding author) [1] [2] Jadhav, Alok [3] Perry, Lindsey J [1] Thagaard, Jeppe 0000-0003-0201-2243 [4] Deftereos, Georgios [5] Ying, Jian [1] Brintz, Ben J [1] Zhang, Wei (Corresponding author) [1] [2]

Affiliations

  1. [1] University of Utah
  2. [NORA names: United States; America, North; OECD];
  3. [2] Huntsman Cancer Institute
  4. [NORA names: United States; America, North; OECD];
  5. [3] Arup Group (United States)
  6. [NORA names: United States; America, North; OECD];
  7. [4] Visiopharm A/S, Hoersholm, Denmark.
  8. [NORA names: Other Companies; Private Research; Denmark; Europe, EU; Nordic; OECD];
  9. [5] University of California, San Francisco
  10. [NORA names: United States; America, North; OECD]

Abstract

Immunohistochemistry (IHC) is used to guide treatment decisions in multiple cancer types. For treatment with checkpoint inhibitors, programmed death ligand 1 (PD-L1) IHC is used as a companion diagnostic. However, the scoring of PD-L1 is complicated by its expression in cancer and immune cells. Separation of cancer and noncancer regions is needed to calculate tumor proportion scores (TPS) of PD-L1, which is based on the percentage of PD-L1-positive cancer cells. Evaluation of PD-L1 expression requires highly experienced pathologists and is often challenging and time-consuming. Here, we used a multi-institutional cohort of 77 lung cancer cases stained centrally with the PD-L1 22C3 clone. We developed a 4-step pipeline for measuring TPS that includes the coregistration of hematoxylin and eosin, PD-L1, and negative control (NC) digital slides for exclusion of necrosis, segmentation of cancer regions, and quantification of PD-L1+ cells. As cancer segmentation is a challenging step for TPS generation, we trained DeepLab V3 in the Visiopharm software package to outline cancer regions in PD-L1 and NC images and evaluated the model performance by mean intersection over union (mIoU) against manual outlines. Only 14 cases were required to accomplish a mIoU of 0.82 for cancer segmentation in hematoxylin-stained NC cases. For PD-L1-stained slides, a model trained on PD-L1 tiles augmented by registered NC tiles achieved a mIoU of 0.79. In segmented cancer regions from whole slide images, the digital TPS achieved an accuracy of 75% against the manual TPS scores from the pathology report. Major reasons for algorithmic inaccuracies include the inclusion of immune cells in cancer outlines and poor nuclear segmentation of cancer cells. Our transparent and stepwise approach and performance metrics can be applied to any IHC assay to provide pathologists with important insights on when to apply and how to evaluate commercial automated IHC scoring systems.

Keywords

Deeplab v3, NC cases, NC images, PD-L1, PD-L1 expression, PD-L1 staining, PD-L1+ cells, Programmed death-ligand 1 immunohistochemistry, TPS scores, Union, Visiopharm, accuracy, algorithm inaccuracy, assay, cancer, cancer cases, cancer cells, cancer segmentation, cancer types, cancerous regions, cases, cells, checkpoint, checkpoint inhibitors, clones, cohort, companion, control, coregistration, death-ligand 1, decision, digital slides, eosin, evaluation, evaluation of PD-L1 expression, exclusion, experienced pathologists, expression, generation, hematoxylin and eosin, images, immune cells, immunohistochemistry, immunohistochemistry assay, inaccuracy, inclusion, inhibitors, intersection, ligand 1, lung, lung cancer cases, mIoU, manual outlining, metrics, model, model performance, multi-institutional cohort, multiple cancer types, necrosis, negative control, noncancer, outline, package, pathologists, pathology, pathology reports, percentage, performance, performance metrics, pipeline, programmed death-ligand 1, proportion score, quantification, region, reports, scores, segmentation of cancerous regions, segments, separation, slide images, slides, software package, tiles, time-consuming, treatment, treatment decisions, tumor proportion score, type, whole slide images

Funders

  • Huntsman Cancer Institute

Data Provider: Digital Science