TY - GEN
T1 - Data clustering approaches survey and analysis
AU - Ahalya, G.
AU - Pandey, Hari Mohan
PY - 2015/7/13
Y1 - 2015/7/13
N2 - In the current world, there is a need to analyze and extract information from data. Clustering is one such analytical method which involves the distribution of data into groups of identical objects. Every group is known as a cluster, which consists of objects that have affinity within the cluster and disparity with the objects in other groups. This paper is intended to examine and evaluate various data clustering algorithms. The two major categories of clustering approaches are partition and hierarchical clustering. The algorithms which are dealt here are: k-means clustering algorithm, hierarchical clustering algorithm, density based clustering algorithm, self-organizing map algorithm, and expectation maximization clustering algorithm. All the mentioned algorithms are explained and analyzed based on the factors like the size of the dataset, type of the data set, number of clusters created, quality, accuracy and performance. This paper also provides the information about the tools which are used to implement the clustering approaches. The purpose of discussing the various software/tools is to make the beginners and new researchers to understand the working, which will help them to come up with new product and approaches for the improvement.
AB - In the current world, there is a need to analyze and extract information from data. Clustering is one such analytical method which involves the distribution of data into groups of identical objects. Every group is known as a cluster, which consists of objects that have affinity within the cluster and disparity with the objects in other groups. This paper is intended to examine and evaluate various data clustering algorithms. The two major categories of clustering approaches are partition and hierarchical clustering. The algorithms which are dealt here are: k-means clustering algorithm, hierarchical clustering algorithm, density based clustering algorithm, self-organizing map algorithm, and expectation maximization clustering algorithm. All the mentioned algorithms are explained and analyzed based on the factors like the size of the dataset, type of the data set, number of clusters created, quality, accuracy and performance. This paper also provides the information about the tools which are used to implement the clustering approaches. The purpose of discussing the various software/tools is to make the beginners and new researchers to understand the working, which will help them to come up with new product and approaches for the improvement.
KW - Clustering
KW - density based clustering algorithm
KW - Expectation maximization clustering algorithm
KW - Hierarchical clustering
KW - K-means clustering algorithm
KW - Self-organization maps algorithm
UR - http://www.scopus.com/inward/record.url?scp=84941254808&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84941254808&partnerID=8YFLogxK
UR - http://www.mendeley.com/research/data-clustering-approaches-survey-analysis
U2 - 10.1109/ABLAZE.2015.7154919
DO - 10.1109/ABLAZE.2015.7154919
M3 - Conference proceeding (ISBN)
AN - SCOPUS:84941254808
SN - 9781479984336
T3 - 2015 1st International Conference on Futuristic Trends in Computational Analysis and Knowledge Management, ABLAZE 2015
SP - 532
EP - 537
BT - 2015 1st International Conference on Futuristic Trends in Computational Analysis and Knowledge Management, ABLAZE 2015
A2 - Kumar, Bhawna
A2 - Singh, Gurinder
A2 - Jassi, J.S.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2015 1st International Conference On Futuristic Trends in Computational Analysis and Knowledge Management, ABLAZE 2015
Y2 - 25 February 2015 through 27 February 2015
ER -