Abstract:For large number of gene expression profiles of the colon tumor, how to use the physiological information obtained in the clinical investigation was researched on during the process that the informative genes were selected in this paper. First, the ReCorre feature selection algorithm was applied, and a primary selection model of informative genes was built up which was based on the maximal criterion of the posterior probability. Then, the posterior probability and the Bhattacharyya distance were fused in the new selection model with the physiological information, so that the new informative genes could be selected. In the end, 2000 colon tumor gene expression samples were analyzed and the usage of physiological information turned out to be feasible and effective in the process of getting the best informative genes, which were checked out by the leave-one-out cross validation (LOOCV) method based on the sport vector machine (SVM) later in order to get the best informative genes.