Clustering-based multi-class classification of complex disease
Conference proceedings article
Authors/Editors
Strategic Research Themes
No matching items found.
Publication Details
Author list: Phongwattana T., Engchuan W., Chan J.H.
Publisher: Hindawi
Publication year: 2015
Start page: 25
End page: 29
Number of pages: 5
ISBN: 9781479960491
ISSN: 0146-9428
eISSN: 1745-4557
Languages: English-Great Britain (EN-GB)
Abstract
Pathway activity data transformed from gene expression profiles may be used to identify tumors, complex diseases progression, and cellular response to stimuli, and so on. Previous researches utilized data mining techniques on pathway activity data to distinguish subjects or to predict the phenotype outcome of subject directly. However, in the multi-class classification, learning those data mixing with population from different groups may result in contaminated model as excessive information is presented. This research, we use a two-stage approach applying clustering to homogenize training data before building the classification model. Hierarchical Clustering is used as a clustering method and Random Forest is used as classifier for evaluating the performance of the proposed method. The results are promising and show that using a clustering technique before classifying improves classification performance in general. ฉ 2015 IEEE.
Keywords
DNA Microarray, Hierarchical Clustering, Pathway Activities, Two-stage Multi-class Analysis