Unsupervised algorithms for population classification and ancestry informative marker selection

Conference proceedings article


Authors/Editors


Strategic Research Themes

No matching items found.


Publication Details

Author listRodpan A., Wangkumhang P., Assawamakin A., Prom-On S., Tongsima S.

PublisherSpringer Verlag (Germany): Computer Proceedings

Publication year2010

Volume number115 CCIS

Start page208

End page216

Number of pages9

ISBN3642167497; 9783642167492

ISSN1865-0929

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-78649524423&doi=10.1007%2f978-3-642-16750-8_18&partnerID=40&md5=675157a8e4d6141fedf117e81ef8f232

LanguagesEnglish-Great Britain (EN-GB)


View in Web of Science | View on publisher site | View citing articles in Web of Science


Abstract

Single Nucleotide Polymorphisms (SNPs) can be used to identify the differences among populations. However, for high-level organisms, there are numerous number of SNPs distributed throughout entire of the genomes. Animal breeders can make use of these genetic markers to different subpopulations. For economical purpose, finding a minimum number of SNPs that can accurately identify different breeds is needed. In this paper, given a set of SNP genotyping samples, without knowing what breed a sample belong to (unlabeled samples), we developed a framework to classify these samples into different animal groups (breeds) based on their genotyping profiles. The proposed framework further identifies a small set of SNPs, called ancestry informative markers (AIMs) that can accurately classify these samples to these groups. The proposed framework adopted the Principal Component Analysis (PCA) technique, and Student's t-test, to cluster unlabeled genotype data and determine AIMs, respectively. This unsupervised approach can avoid potential ascertainment biases due to mistakenly label some samples or having unlabeled data to be classified. ฉ 2010 Springer-Verlag Berlin Heidelberg.


Keywords

AIMsancestry informative markerspopulation structureStudent's t-test


Last updated on 2023-17-10 at 07:35