Bimonthly    Since 1986
ISSN 1004-9037
Publication Details
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
 
   
      02 June 2023, Volume 38 Issue 3
    Article

    DEEP ADVERSARIAL REGULARIZED AUTOENCODER TECHNIQUE FOR HIGH DIMENSIONAL DATA CLUSTERING AND DATA PRIVACY PRESERVING PARADIGM IN BIG DATA CLOUD ENVIRONMENT
    Kiruthika B1, Dr. B. Srinivasan2, Dr. P. Prabhusundhar3
    Journal of Data Acquisition and Processing, 2023, 38 (3): 3365-3376 . 

    Abstract

    High dimensional data clustering have gained significant attention recently due to increased utilization of the high dimensional data streams across the large distributed cloud. High dimensional data clustering approaches has been developed in the conventional work using both machine learning and deep learning architectures. Despite of many advantageous, it is inefficient in handling the curse of dimensionality and data sparsity issues. Further learning model leads to high computational complexity and data disclosure attacks to various transaction queries to cloud servers. In order to mitigate those challenges, high scalable secure regularized model has been designed which is entitled as Deep Adversarial Regularized Autoencoder Technique. Autoencoder is a popular mechanism to accomplish dimensionality reduction. Proposed model deeply explore the latent structure of the data and computes the associations of the data points to construct the spatial and temporal cluster structures to high dimensional data as new clustering perspective. It discriminate the data points efficiently. Further evolving data streams are approximated using the variational autoencoder to preserve the cluster structures. Euclidean distance is employed in embedding function of the autoencoder to generate the efficient data clusters with minimized intra cluster similarity and inter cluster variation in the feature space. Hyper parameter tuning using RMSProp has been enabled in the output layer to make the data instance in the cluster to be close to each other by determining the affinity of the data on new representation. Experimental analysis has been performed on benchmarks datasets such as Twitter and Forest Cover to compute the proposed model performance with conventional approaches. The performance outcome represents that good scalability and effectiveness on high dimensional data has been reached.

    Keyword

    High Dimensional Data clustering, Big Data Cloud, Privacy Preserving, Variational Autoencoder, Advanced Networking


    PDF Download (click here)

SCImago Journal & Country Rank

ISSN 1004-9037

         

Home
Editorial Board
Author Guidelines
Subscription
Journal of Data Acquisition and Processing
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: info@sjcjycl.cn
 
  Copyright ©2015 JCST, All Rights Reserved