IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets

Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets
View Sample PDF
Author(s): Mudasir Mohd (University of Kashmir, Srinagar, India), Rafiya Jan (Central university Of Kashmir, Srinagar, India) and Nida Hakak (Mahareshi Dayanand University, Haryana, India)
Copyright: 2020
Volume: 14
Issue: 2
Pages: 26
Source title: International Journal of Cognitive Informatics and Natural Intelligence (IJCINI)
Editor(s)-in-Chief: Kangshun Li (South China Agricultural University, China)
DOI: 10.4018/IJCINI.2020040103

Purchase

View Enhanced Bootstrapping Algorithm for Automatic Annotation of Tweets on the publisher's website for pricing and purchasing information.

Abstract

Annotations are critical in various text mining tasks such as opinion mining, sentiment analysis, word sense disambiguation. Supervised learning algorithms start with the training of the classifier and require manually annotated datasets. However, manual annotations are often subjective, biased, onerous, and burdensome to develop; therefore, there is a need for automatic annotation. Automatic annotators automatically annotate the data for creating the training set for the supervised classifier, but lack subjectivity and ignore semantics of underlying textual structures. The objective of this research is to develop scalable and semantically rich automatic annotation system while incorporating domain dependent characteristics of the annotation process. The authors devised an enhanced bootstrapping algorithm for the automatic annotation of Tweets and employed distributional semantic models (LSA and Word2Vec) to augment the novel Bootstrapping algorithm and tested the proposed algorithm on the 12,000 crowd-sourced annotated Tweets and achieved a 68.56% accuracy which is higher than the baseline accuracy.

Related Content

Jun Ye. © 2020. 12 pages.
Adnen Mahmoud, Mounir Zrigui. © 2020. 16 pages.
Alae Chouiekh, El Hassane Ibn El Haj. © 2020. 16 pages.
Mudasir Mohd, Rafiya Jan, Nida Hakak. © 2020. 26 pages.
Gaurav Aggarwal, Latika Singh. © 2020. 19 pages.
Ying Huang, Liyun Zhong, Yan Chen. © 2020. 15 pages.
Maryam Ghanbari, Witold Kinsner. © 2020. 18 pages.
Body Bottom