Cluster Validity Index for Big Data Based on Density Discriminant Analysis

Conference proceedings article


Authors/Editors


Strategic Research Themes


Publication Details

Author listWeerapong T., Sathapornvajana S., Padungweang P., Krathu W.

PublisherHindawi

Publication year2020

Start page1

End page4

Number of pages4

ISBN9781728181066

ISSN0146-9428

eISSN1745-4557

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85097348238&doi=10.1109%2fIBDAP50342.2020.9245612&partnerID=40&md5=1161ba69d8859b5847d535dcf07efd70

LanguagesEnglish-Great Britain (EN-GB)


View on publisher site


Abstract

The important factor for clustering unsupervised data is the Cluster Validity Index indicating appropriate number of clusters. The paper proposes the application of the unsupervised density discriminant analysis algorithm for cluster validation in the context of Big Data. In particular, the experiment was conducted to perform clustering tasks on big dataset by using centroid based clustering algorithm and apply unsupervised density discriminant analysis algorithm to find the most appropriate number of clusters. The performance evaluation was performed by means of processing time. The result shows that the time used to perform the clustering task depends on number of features and clusters. © 2020 IEEE.


Keywords

Apache Spark


Last updated on 2023-26-09 at 07:36