Cancer Science & Research

Cancer Science & Research

Open Access
ISSN: 2639-8478
Original Research Article

Accurate Prediction of Breast Cancer Survival via PM Generative AI

Authors: Philip de Melo.

DOI: 10.33425/2639-8478.1121


Abstract

Accurate classification of breast cancer remains one of the most important challenges in healthcare artificial intelligence because of the complex biologic heterogeneity of tumor progression and the imbalance frequently observed between malignant and nonmalignant patient cohorts. Conventional machine learning algorithms trained on highly imbalanced clinical datasets often demonstrate strong overall accuracy while underperforming in detection of minority malignant cases, thereby limiting diagnostic sensitivity and early intervention capability. The present study proposes a novel augmentation-based breast cancer classification framework that combines machine learning with synthetic physiologic and diagnostic feature generation to improve classification performance and cohort balance. The proposed methodology utilizes real breast cancer patient records as biologic templates while generating clinically coherent synthetic cancer cases incorporating controlled variability in tumor-associated characteristics, including lesion size, cellular morphology, biomarker expression, lymphatic involvement, tumor stage, and physiologic progression indicators. The resulting balanced dataset was subsequently analyzed using Random Forest and XGBoost classifiers to evaluate improvements in malignant case discrimination and overall classification performance.

Comparative analysis demonstrated that conventional Random Forest classification was affected by cohort imbalance and preferentially classified majority nonmalignant cases. In contrast, the XGBoost boosting framework demonstrated improved sensitivity for malignant tumor detection through iterative optimization of previously misclassified observations. Further improvement was achieved following implementation of the proposed augmentation- based balancing strategy, which substantially enhanced minority malignant case representation during machine learning training. The augmented classifier demonstrated improved confusion matrix performance, enhanced Receiver Operating Characteristic (ROC) discrimination, and increased sensitivity for malignant breast cancer detection while preserving clinically realistic biologic variability within the synthetic cohort. The findings suggest that physiologically coherent synthetic augmentation may provide an effective strategy for improving machine learning classification performance in breast cancer analytics and may represent a promising direction for future healthcare artificial intelligence systems involving rare, heterogeneous, or clinically imbalanced disease populations.

View / Download PDF
Citation: Philip de Melo. Accurate Prediction of Breast Cancer Survival via PM Generative AI. Cancer Sci Res. 2026; 9(1). DOI: 10.33425/2639-8478.1121
Editor-in-Chief
Lee Guek Eng
Lee Guek Eng
Department of Medical Oncology | National University of Singapore

View full editorial board →
Journal Metrics
Impact Factor 2.4*
Acceptance Rate 75%
Time to first decision 6-10 Days
Submission to acceptance 12-15 Days