International Academic Journal of Science and Engineering

  • ISSN 2454-3896

Clustering Algorithms in Big data

Shirin Abbasi and Babak Vaziri

Abstract: Massive data processing and management is taken into consideration in recent years, frameworks and special tools have been introduced for this purpose and data analyzing is studied as massive data management. Data mining and machine learning techniques, in many organizations and industries are used to improve business, forecasting and boom And according to enterprise‟s data growth and high-volume and high-speed data generation, data mining techniques in massive data, play an important role in these developments. Operations carried out on data management that group of such operation are as preprocessing which is performed on data to prepare data for basic operations. On the other hand in cloud environments, due to environment distribution, grouping and data sorting operation is important greatly. In this paper we have studied the main clustering algorithm K-Means, and due to studies on this algorithm, some ways suggested that consider main parameters in cloud environments and improve the massive data clustering

Keywords: Clustering – massive data- massive data management – cloud storage – cloud computing – K-Means

Page: 182-192

Volume 2, Issue 2, 2015