Table 1 DNA MICROARRAY DATASET SPECIFICATIONS [4] The preprocessing process corrects problems that arise in the data processing. Such as data with too many features and the high difference in range values in each feature. These problems can cause the results of data processing to be less good or not optimal. The data used in this final project is DNA microarray data from Kent Ridge Biomedical Dataset [4]. The DNA microarray data consists of ovarian cancer, lung cancer, breast cancer, and colon cancer. The dataset specifications used can be seen in Table I.