Presentation + Paper
24 April 2020 Gene expression classification using L1 norm PCA
Zachary Walker, Colleen P. Bailey, Hae Jin Kim
Author Affiliations +
Abstract
Gene microarray data generally includes high dimension, small sample datasets prone to noise. Analyzing this data using supervised and non-supervised learning algorithms is extremely useful for gene characterization, disease diagnosis, and genetic therapy in the medical field. For many years, principal component analysis (PCA) has been used as a tool in algorithms for gene expression classification. Previous solutions utilize L2 norm based PCA, however with its superior resistance to outlier data, L1 norm PCA offers improved results. Both methods are compared using support vector machines (SVM) to classify genetic mutations and co-regulation in several publicly available datasets. Methods utilizing L1 PCA result in improved accuracy compared to L2 PCA when used as a pre-processing step to SVM classification for gene microarray data.
Conference Presentation
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zachary Walker, Colleen P. Bailey, and Hae Jin Kim "Gene expression classification using L1 norm PCA", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950I (24 April 2020); https://doi.org/10.1117/12.2560883
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Principal component analysis

Leukemia

Yeast

Alzheimer's disease

Machine learning

Genetics

Visualization

Back to Top