Abstract

Precision medicine for papillary thyroid carcinoma (PTC) variants: A machine learning approach to prognosis and treatment guidance.

Author
person Sakhr Abdulsalam Alshwayyat Jordan University of Science and Technology, Irbid, Jordan info_outline Sakhr Abdulsalam Alshwayyat, Abdalwahab Alenezy, Wafa Asha, Mustafa Alshwayyat, Haya Kamal, Owais Ghammaz, Tala Abdulsalam Alshwayyat
Full text
Authors person Sakhr Abdulsalam Alshwayyat Jordan University of Science and Technology, Irbid, Jordan info_outline Sakhr Abdulsalam Alshwayyat, Abdalwahab Alenezy, Wafa Asha, Mustafa Alshwayyat, Haya Kamal, Owais Ghammaz, Tala Abdulsalam Alshwayyat Organizations Jordan University of Science and Technology, Irbid, Jordan, King Hussein Cancer Center, Amman, Jordan, Jordan University of Science and Technology, Aydoun-Irbid, Jordan, Jordan University of Science & Technology, Irbid, Jordan, Jordan University of Science and Technology, Jordan, Irbid, Jordan Abstract Disclosures Research Funding No funding sources reported Background: PTC is a common endocrine cancer with a good prognosis, but aggressive subtypes, such as Hürthle cell (HCC) and columnar cell variants (CCV), pose challenges due to their higher recurrence and metastasis rates. To provide personalized care, we applied machine learning to evaluate the treatment effectiveness and develop precise prognostic models for PTC variants. Methods: The Surveillance, Epidemiology, and End Results (SEER) database provided the data used for this study’s analysis (2000–2019). Patients who met any of the following criteria were excluded: diagnosis not confirmed by histology, previous history of cancer or with other concurrent malignancies, and unknown data. To identify prognostic variables, we conducted Cox regression analysis and constructed prognostic models using machine learning (ML) algorithms to predict the 5-year survival. Patient records were randomly divided into training (70 %) and validation (30 %) sets. A validation method incorporating the area under the curve (AUC) of the receiver operating characteristic curve was used to validate the accuracy and reliability of the ML models. Results: The study population comprised 3690 patients. Among them 3180 patients with CCV and 510 patients with HCC, respectively. Most patients (62.8%) were 45 years or older, with a median age of 52 years. A total of 56.9% of patients had a tumor size greater than 2 cm, with a median tumor size of 3.1 cm. The largest racial group was white, comprising 83.8% of the cases, and 11.8% of the cases were Asian. Most cases were regional (53.4%, n=1969), followed by localized (38.3%, n= 1413). Multivariate Cox regression analysis revealed that N1 negatively affected the survival of HCC patients. CCV has a favorable prognosis after surgery, radiotherapy, or total thyroidectomy. Poor prognosis in CCV is associated with black race, large tumor size, and T4 stage. Improved survival in the localized/regional stage and decreased survival with male sex, older age, distant metastasis, and advanced AJCC stage in both PTC subtypes. ML models revealed that the random forest classifier (RFC) and K-Nearest Neighbors (KNN) accurately predicted outcomes, followed by Logistic Regression (LR) models. The highest contributing factors were AJCC staging, tumor size, and T aspect of TNM staging. Conclusions: Our study offers a method for evaluating and treating patients with PTC variants. The machine learning model that we created serves as a useful and personalized resource to aid in clinical decision-making processes. Machine learning (ML) algorithms performance. ML Algorithm Accuracy Precision Recall F1 score AUC HCC LR 67.65% 62.50% 74.47% 67.96% 0.7097 RF 81.37% 78.00% 82.98% 80.41% 0.8832 KNN 75.49% 71.15% 78.72% 74.75% 0.7841 CCV LR 67.14% 64.31% 73.23% 68.48% 0.7277 RF 82.70% 80.67% 84.84% 82.70% 0.9073 KNN 76.42% 72.60% 82.90% 77.41% 0.8244

3 organizations