Lasso based gene selection for linear classifiers

Abstract

Selecting a subset of genes with strong discriminative power is a very important step in classification problems based on gene expression data. Lasso is known to have automatic variable selection ability in linear regression analysis. This paper uses Lasso to select most informative genes to represent the class label as a linear function of the gene expression data. The selected genes are further used to fit linear classifiers for tumor classification. The proposed approach (gene selection and linear classification) was applied to 5 publicly available cancer datasets. Compared to other methods in literature, the proposed method achieves similar or higher classification accuracy with fewer genes.

Department(s)

Mathematics

Document Type

Conference Proceeding

DOI

https://doi.org/10.1109/BIBMW.2009.5332127

Keywords

Cross validation, Lasso, Leave-one-out, Linear classifier, Variable selection

Publication Date

12-1-2009

Journal Title

Proceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009

Share

COinS