|
|
Bimonthly Since 1986 |
ISSN 1004-9037
|
|
|
|
|
Publication Details |
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
|
|
|
|
|
|
|
|
|
|
Abstract
Data mining refers to the preset procedures and algorithms used to extract these valuable patterns. The research aims to improve partition-based clustering algorithmswith advanced features of efficient data analysis and automatically generate an appropriate number of clusters.The efficiency of K-Means clustering is further challenged by real-world datasets with high dimensionality. As a result, the algorithm becomes too expensive to implement. With an increase in size comes a decrease in cluster quality. This study proposes a K-Modes algorithm-based technique for working efficiently with large dimension datasets.Improvements can be made to this approach by eliminating non-significant features from the clustering process, which reduces the dimensionality of the clusters created and improves their accuracy. However, this number can be used as an input depending on user requirements if it has a significance value greater than or equal to 60% of the maximum significance value in the proposed algorithm.
Keyword
Data Mining, K-Means, Clustering, IoT
PDF Download (click here)
|
|
|
|
|