Comparative analysis of reliefF-SVM and CFS-SVM for microarray data classification

Mochamad Agusta Naofal Hakim, Adiwijaya Adiwijaya, Widi Astuti

Abstract


Cancer is one of the main causes of death in the world where the World Health Organization (WHO) recognized cancer as among the top causes of death in 2018. Thus, detecting cancer symptoms is paramount in order to cure and subsequently reduce the casualties due to cancer disease. Many studies have been developed data mining approaches to detect symptoms of cancer through a classifying human gene data expression. One popular approach is using microarray data based on DNA. However, DNA microarray data has many dimensions that can have a detrimental effect on the accuracy of classification. Therefore, before performing classification, a feature selection technique must be used to eliminate features that do not have important information to support the classification process. The feature selection techniques used were ReliefF and Correlation-based Feature Selection (CFS) and a classification technique used in this study is Support Vector Machine (SVM). Several testing schemes were applied in this analysis to compare the performance of ReliefF and Correlation-based Feature Selection (CFS) with Support Vector Machine (SVM). It showed that the ReliefF outperformed compared with CFS as microarray data classification approach.

Keywords


classification; correlation-based feature selection (CFS); feature selection; microarray data; reliefF; support vector machine (SVM);



DOI: http://doi.org/10.11591/ijece.v11i4.pp%25p
Total views : 0 times


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

ISSN 2088-8708, e-ISSN 2722-2578