IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

A Web Semantic-Based Text Analysis Approach for Enhancing Named Entity Recognition Using PU-Learning and Negative Sampling

A Web Semantic-Based Text Analysis Approach for Enhancing Named Entity Recognition Using PU-Learning and Negative Sampling
View Sample PDF
Author(s): Shunqin Zhang (School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China), Sanguo Zhang (School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China), Wenduo He (Institute for Network Sciences and Cyberspace (INSC), Tsinghua University, Beijing, China)and Xuan Zhang (Tsinghua University, China)
Copyright: 2024
Volume: 20
Issue: 1
Pages: 23
Source title: International Journal on Semantic Web and Information Systems (IJSWIS)
Editor(s)-in-Chief: Brij Gupta (Asia University, Taichung City, Taiwan)
DOI: 10.4018/IJSWIS.335113

Purchase


Abstract

The NER task is largely developed based on well-annotated data. However, in many scenarios, the entities may not be fully annotated, leading to serious performance degradation. To address this issue, the authors propose a robust NER approach that combines a novel PU-learning algorithm and negative sampling. Unlike many existing studies, the proposed method adopts a two-step procedure for handling unlabeled entities, thereby enhancing its capability to mitigate the impact of such entities. Moreover, this algorithm demonstrates high versatility and can be integrated into any token-level NER model with ease. The effectiveness of the proposed method is verified on several classic NER models and datasets, demonstrating its strong ability to handle unlabeled entities. Finally, the authors achieve competitive performances on synthetic and real-world datasets.

Related Content

Zhou Li, Gengming Xie, Varsha Arya, Kwok Tai Chui. © 2024. 10 pages.
Ming-Te Chen, Yi Yang Chang, Ta Jen Wu. © 2024. 22 pages.
Mingrui Zhao, Chunjing Shi, Yixiao Yuan. © 2024. 30 pages.
Qi Zhou, Zhoupu Wang. © 2024. 20 pages.
Shunqin Zhang, Sanguo Zhang, Wenduo He, Xuan Zhang. © 2024. 23 pages.
Qi Zhou, Chun Shi. © 2024. 28 pages.
Hao Ma, Zhiyi Gai. © 2024. 27 pages.
Body Bottom