Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/15558
Title: New Feature Selection Method for High Dimensional Gene Data
Authors: Fajila, M.N.F.
Nawarathna, R.D.
Keywords: Classification
Dimensionality reduction
Feature selection
Gene selection
Microarray experiment
Issue Date: 2016
Publisher: Department of Statistics & Computer Science, University of Kelaniya, Sri Lanka
Citation: Fajila, M.N.F. and Nawarathna, R.D. 2016. New Feature Selection Method for High Dimensional Gene Data. Symposium on Statistical & Computational Modelling with Applications (SymSCMA – 2016), Department of Statistics & Computer Science, University of Kelaniya, Sri Lanka. p 66-69.
Abstract: Dimensionality reduction (i.e., feature selection) is an essential technique in data science when handling high dimensional data such as cancer microarray samples. Cancer microarray experiments normally provide a large number of data which is assumed to contain many features, called, genes. However, genes can be either redundant or irrelevant, and thus be removed without incurring much loss of information. A small number of samples with a large number of genes is the major problem in microarray data analysis. In this study, a new machine learning method, namely, hybrid wrapper – filter feature selection is proposed for gene selection. This approach combines the genes selected by both filter and wrapper feature selection methods. Further, it uses a least priority feature elimination procedure where the genes with the lowest priority are eliminated. The propsoed technique is validated and evaluated on two microarray data sets namely, leukemia and colon cancer data sets. With gene selection performed by the proposed method, it helps to classify the leukemia microarray samples with perfect classification (100%) and to classify the colon cancer data set only with two misclassifications giving an accuracy of 90.5%. Results show that the proposed technique is extremely efficient in terms of the computational time too.
URI: http://repository.kln.ac.lk/handle/123456789/15558
Appears in Collections:SymSCMA – 2016

Files in This Item:
File Description SizeFormat 
66-69.pdf477.35 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.