Abstract

A machine learning algorithm based on multi-omics biomarkers for the detection of tumor microsatellite instability.

Author
person Kyle C Strickland Labcorp Oncology, Duke University Medical Center, Duke Cancer Institute, Department of Pathology, Durham, NC info_outline Kyle C Strickland, Sarabjot Pabla, Zachary D Wallen, Mary K Nesline, Heidi Chwan Ko, Rebecca A. Previs, Jeffrey M Conroy, Pratheesh Sathyan, Kamal S. Saini, Taylor J. Jensen, Brian Caveney, Marcia Eisenberg, Shakti H Ramkissoon, Eric A Severson
Full text
Authors person Kyle C Strickland Labcorp Oncology, Duke University Medical Center, Duke Cancer Institute, Department of Pathology, Durham, NC info_outline Kyle C Strickland, Sarabjot Pabla, Zachary D Wallen, Mary K Nesline, Heidi Chwan Ko, Rebecca A. Previs, Jeffrey M Conroy, Pratheesh Sathyan, Kamal S. Saini, Taylor J. Jensen, Brian Caveney, Marcia Eisenberg, Shakti H Ramkissoon, Eric A Severson Organizations Labcorp Oncology, Duke University Medical Center, Duke Cancer Institute, Department of Pathology, Durham, NC, Labcorp Oncology, Buffalo, NY, Labcorp Oncology, Durham, NC, Labcorp Oncology, Duke University Medical Center, Duke Cancer Institute, Department of Obstetrics & Gynecology, Division of Gynecologic Oncology, Durham, NC, Illumina Inc., San Diego, CA, Fortrea Inc., Durham, NC, Labcorp, Burlington, NC, Labcorp Oncology, Wake Forest Comprehensive Cancer Center and Department of Pathology, Wake Forest School of Medicine, Durham, NC Abstract Disclosures Research Funding No funding sources reported Background: Microsatellite instability (MSI) occurs via defects in mismatch repair (MMR) and is characterized by high tumor mutational burden (TMB) and a robust inflammatory response. Accurate MSI-high classification of immune-evasive tumors is vital due to the effectiveness of immune checkpoint inhibitor (ICI) therapy, such as pembrolizumab, which is FDA approved for all MSI-high tumors irrespective of anatomic origin. Detection methods of MSI-status vary widely, including immunohistochemistry (IHC), polymerase chain reaction (PCR), and next-generation sequencing (NGS). We developed a machine learning (ML) model to predict MSI status in patients with solid tumors using comprehensive genomic and immune profiling (CGIP), independent of direct sequencing data from microsatellite sites. Methods: We analyzed samples from 1,838 patients with colorectal cancer (CRC) by CGIP, which included DNA panel testing (523 genes) for pathogenic single nucleotide variants (pSNV) and determination of TMB (mut/Mb), and RNA sequencing (397 genes) for gene expression (GEX). The Boruta algorithm was used to select key genomic and GEX changes associated with MSI status. A distributed gradient boosting algorithm created a predictive model that was trained on 70% of the cohort. The model was tested on the remaining CRC cohort (Test), and in the PanCancer Atlas CRCs (TCGA), endometrial adenocarcinomas (EMCA), and CRC with indeterminate MSI testing by CGIP. Performance of the trained model was assessed using sensitivity, specificity, and positive (PPV) and negative (NPV) predictive value. Results: Feature selection identified 79 genes with pSNVs, 63 GEX changes, and TMB as informative for MSI prediction. The model showed high predictive accuracy in differentiating MSI-H tumors. Of the 39 cases that failed MSI component of CGIP (Indeterminate), 17 had MMR IHC results available, of which 1 demonstrated loss of MLH1/PMS2 and the other 16 showed intact (normal) expression. Testing of this cohort identified 3 cases as MSI-high, at least one of which was confirmed by IHC. No cases with intact MMR protein IHC were identified as MSI-high by the algorithm. Conclusions: Our ML-driven approach accurately assessed MSI status of CRC and EMCA using CGIP data. This approach also identified potential cases with MSI-high status where direct sequencing of microsatellites failed. This study highlights a method to identify patients with potential MSI-high status for orthogonal screening with the MSI component of a test fails. Cohort Sample Size Median Age (y) % Female % MSI-High by CGIP Sensitivity Specificity PPV NPV Training 1286 65 44% 6.5% n/a n/a n/a n/a Test 552 64.9 46% 7.6% 95.2 99.6 95.2 99.6 TCGA 367 68 47% 13.3% 98.1 92 92 98.1 EMCA 394 68.25 100% 15.9% 95.8 93.7 80.8 98.8 Indeterminate 39 65 60% n/a n/a n/a n/a n/a

7 organizations

1 drug

1 target

Organization
Labcorp Oncology
Organization
Illumina Inc.
Organization
Fortrea Inc.