VOPROSY ONKOLOGII, cilt.7, sa.2, 2025 (Scopus)
Aim: Survival prediction in brain cancer is crucial
for guiding treatment decisions and improving patient outcomes. This
study aimed to develop a novel prognostic model for brain cancer patient
survival using advanced supervised machine learning classification approaches.
The model incorporated demographic, clinical, immunohistochemical, and genomic
characteristics.
Methods: A retrospective examination was conducted on a cohort of 149
patients with intracranial tumors who underwent operative procedures.
Systematic collection of demographic and clinical data was performed. Tumor and
adjacent healthy tissues underwent histopathological scrutiny and
immunohistochemical analysis for GST-P, GST-T, GST-M, CYP1A1, CYP1B1, MDR, and
p53 expression. Staining intensity was graded, and genomic DNA from tumor tissues
was analyzed for GSTM1, GSTT1, and p53 genotypes. Five categories of input
features, comprising demographic, clinical, protein expression profiles, and
genotypic information, were selected for model development. Totally 15 models
were constructed via decision tree, naïve bayes and SVM, using Python, and
compared for accuracy, precision, sensitivity, and F-measure metrics.
Results: The overall postoperative survival rate was 65%. Protein
expression analysis revealed significant differences between cancerous and
normal tissues for GST-P, GST-T, GST-M, CYP1A1, CYP1B1, MDR, and p53 genes.
Genotyping indicated that the GST-M1 null genotype might be associated with
brain tumor development. Model evaluation demonstrated a decision tree accuracy
of 84%, the highest among models incorporating demographic, clinical,
immunohistochemical, and genetic information. Precision and sensitivity varied,
but the decision tree showed acceptable values for accuracy, precision, and
sensitivity.
Conclusions: Decision trees may be suitable for predicting
brain cancer survival particularly in limited datasets, compared to SVM and
Naive Bayes when using demographic, clinical, immunohistochemical, and
genotypic variables.