Bimonthly    Since 1986
ISSN 1004-9037
Publication Details
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
 
   
      30 Dec 2022, Volume 37 Issue 5   
    Article

    DATA MINING USING SUPERVISED INSTANCE SELECTION (SIS) FOR BETTER CLASSIFICATION ACCURACY IN ARTIFICIAL NEURAL NETWORKS
    S. Srinivas Reddy and Dr. Rajeev G. Vishwkarma
    Journal of Data Acquisition and Processing, 2022, 37 (5): 1385-1393 . 

    Abstract

    The semi-supervised learning techniques use abundant unlabeled data for helping to learn a better classifier if the number of instances is very less. A basic technique is to choose and label the unlabeled instances that the present classifier has higher classification confidence for enlarging the labeled training set and the updating the classifier, which is mostly utilized in two different paradigms of semi-supervised learning namely: co-training and self-training. But the actual labeled instances will be more reliable compared to self-labeled instances which would be labeled by a classifier. If unlabeled instances are assigned to wrong labels then the classification accuracy of classifier might be jeopardized. In this paper, a new instance selection technique is presented based on real labeled data. This will consider present classifier performance on unlabeled data as well as its performance only on real labeled data. In every iteration, this utilizes the accuracy changes in newly learned classifier over original labeled data as a criteria for deciding either the chosen most confident unlabeled instances would be accepted by further iteration or not. The experiments will be conducted in co-training as well as self-training while using Naïve Bayes (NB) as a base classifier. The results show that, SIS will significantly improve accuracy and classification of self-training and co-training. From results it can observe that it will improve the accuracy, Precision, Recall and F1 score compared with semi-supervised classification method.

    Keyword

    Supervised Instance Selection (SIS), Data mining, Meta-learning, Algorithm selection.


    PDF Download (click here)

SCImago Journal & Country Rank

ISSN 1004-9037

         

Home
Editorial Board
Author Guidelines
Subscription
Journal of Data Acquisition and Processing
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: info@sjcjycl.cn
 
  Copyright ©2015 JCST, All Rights Reserved