講 題:Overlapping group screening for binary cancer classification with TCGA high-dimensional genomic data
主講人:王价輝 副教授(國立中正大學數學系)
時 間:2023年11月02日(星期四)下午02:10 - 04:00
地 點:B302A(淡水校園商管大樓)
茶 會:2023年11月02日(星期四)下午01:30 (商管大樓 B1102)
摘 要
Precision medicine has been a global trend of medical development, wherein cancer diagnosis plays an important role. With accurate diagnosis of cancer, we can provide patients with appropriate medical treatments for improving patients' survival. Since disease developments involve complex interplay among multiple factors such as gene–gene interactions, cancer classifications based on microarray gene expression profiling data are expected to be effective, and hence, have attracted extensive attention in computational biology and medicine. However, when using genomic data to build a diagnostic model, there exist several problems to be overcome, including the high-dimensional feature space and feature contamination. In this paper, we propose using the overlapping group screening (OGS) approach to build an accurate cancer diagnosis model and predict the probability of a patient falling into some disease classification category in the logistic regression framework. This new proposal integrates gene pathway information into the procedure for identifying genes and gene–gene interactions associated with the classification of cancer outcome groups. We conduct a series of simulation studies to compare the predictive accuracy of our proposed method for cancer diagnosis with some existing machine learning methods, and find the better performances of the former method. We apply the proposed method to the genomic data of The Cancer Genome Atlas related to lung adenocarcinoma (LUAD), liver hepatocellular carcinoma (LIHC), and thyroid carcinoma (THCA), to establish accurate cancer diagnosis models. (This is a joint work with Prof. Chen from the Institute of Statistical Science at Academia Sinica.)
Keywords: Cancer diagnosis; gene–gene interaction; logistic regression; overlapping group screening; precision medicine; TCGA.