Cluster Validity Index for Big Data Based on Density Discriminant Analysis
Conference proceedings article
Authors/Editors
Strategic Research Themes
Publication Details
Author list: Weerapong T., Sathapornvajana S., Padungweang P., Krathu W.
Publisher: Hindawi
Publication year: 2020
Start page: 1
End page: 4
Number of pages: 4
ISBN: 9781728181066
ISSN: 0146-9428
eISSN: 1745-4557
Languages: English-Great Britain (EN-GB)
Abstract
The important factor for clustering unsupervised data is the Cluster Validity Index indicating appropriate number of clusters. The paper proposes the application of the unsupervised density discriminant analysis algorithm for cluster validation in the context of Big Data. In particular, the experiment was conducted to perform clustering tasks on big dataset by using centroid based clustering algorithm and apply unsupervised density discriminant analysis algorithm to find the most appropriate number of clusters. The performance evaluation was performed by means of processing time. The result shows that the time used to perform the clustering task depends on number of features and clusters. © 2020 IEEE.
Keywords
Apache Spark