Header menu link for other important links
X
Adaptive K-Means Clustering to Handle Heterogeneous Data Using Basic Rough Set Theory
, Ghosh A, Panda G.K.
Published in Springer Berlin Heidelberg
2012
Volume: 84
   
Issue: PART 1
Pages: 193 - 202
Abstract
Several cluster analysis techniques have been developed till the present to group objects having similar property or similar characteristics and K-means clustering is one of the most popular statistical clustering techniques proposed by Macqueen [12] in 1967. But this algorithm is unable to handle the categorical data and unable to handle uncertainty as well. But after proposing the rough set theory by Pawlak [15], we have an alternative way of representing sets whose exact boundary cannot be described due to incomplete information. As rough set has been widely used for knowledge representation, hence it can also be applied in classification and very helpful in clustering too. In real life data mining applications we do not have the crisp boundaries for clusters. So, in 2007 and 2009 Parmar et al [14] and Tripathy et al [16] proposed two algorithms MMR and MMeR using rough set theory but these two algorithms have the stability problem due to multiple runs and higher time complexity. In this paper we are proposing a new approach of k-means algorithm using rough set which can handle heterogeneous data and uncertainty as well. © Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2012.