Bimonthly    Since 1986
ISSN 1004-9037
Publication Details
Edited by: Editorial Board of Journal of Data Acquisition and Processing
P.O. Box 2704, Beijing 100190, P.R. China
Sponsored by: Institute of Computing Technology, CAS & China Computer Federation
Undertaken by: Institute of Computing Technology, CAS
Published by: SCIENCE PRESS, BEIJING, CHINA
Distributed by:
China: All Local Post Offices
 
   
      1 Jan 2023, Volume 38 Issue 1   
    Article

    1. AFAAN OROMO PRONOMINAL ANAPHORA RESOLUTION USING CONDITIONAL RANDOM FIELDS
    Bikiltu Guteta1, Teklu Urgessa2, T.Gopi Krishna3,Shubhashish Bhakta4, Sobha Lalitha Devi5
    Journal of Data Acquisition and Processing, 2023, 38 (1): 1184-1201 . 

    Abstract

    Anaphora resolution is the process of determining the antecedent of an anaphor (AR). AR is a challenging and complex issue in the field of Natural Language Processing (NLP). Very few works have been done for anaphora resolution in the Afaan Oromo language and this is due to lack of resources. Most researchers have applied a rule-based approach for anaphora resolution in Afaan Oromo. This paper proposed the pronominal anaphora resolution model for the Afaan Oromo language developed using a conditional random field (CRF), which deals with both Intra-sentential and inter-sentential kind of anaphora. CRF++ 0.58 tool has been used to develop the model. Afaan Oromo texts are collected from different sources such as Afaan Oromo holly bible, BBC Afaan Oromo news, Afaan Oromo grade 9 and 11 student textbooks to evaluate the performance of this model. Totally 1330 sentences with 12571 tokens were collected for both independent and hidden anaphors. From this collected and prepared dataset, 80 % of the dataset was used for training and the remaining 20% is for testing data. The precision of 78.87 percent, recall of 91.80 percent, and F-measure of 84.85 percent for resolution of independent anaphors were obtained using the CRF approach for Afaan Oromo pronominal anaphora resolution. The precision of 80.41 percent, recall of 95.12 percent, and F-measure of 87.15 percent were obtained for the resolution of hidden anaphors.

    Keyword

    Natural Language Processing, Anaphora resolution, Afaan Oromo, Machine Learning, Conditional Random Fields.


    PDF Download (click here)

SCImago Journal & Country Rank

ISSN 1004-9037

         

Home
Editorial Board
Author Guidelines
Subscription
Journal of Data Acquisition and Processing
Institute of Computing Technology, Chinese Academy of Sciences
P.O. Box 2704, Beijing 100190 P.R. China
E-mail: info@sjcjycl.cn
 
  Copyright ©2015 JCST, All Rights Reserved