IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

BHA2: Bio-Inspired Algorithm and Automatic Summarisation for Detecting Different Types of Plagiarism

BHA2: Bio-Inspired Algorithm and Automatic Summarisation for Detecting Different Types of Plagiarism
View Sample PDF
Author(s): Hadj Ahmed Bouarara (Tahar Moulay University of Saida, Algeria), Reda Mohamed Hamou (Department of Computer Science, Tahar Moulay University of Saida, Algeria, Saida, Algeria)and Amine Rahmani (GeCoDe Laboratory, Department of Computer Sciences, Dr. Tahar Moulay University of Saida, Algeria)
Copyright: 2019
Pages: 27
Source title: Scholarly Ethics and Publishing: Breakthroughs in Research and Practice
Source Author(s)/Editor(s): Information Resources Management Association (USA)
DOI: 10.4018/978-1-5225-8057-7.ch018

Purchase

View BHA2: Bio-Inspired Algorithm and Automatic Summarisation for Detecting Different Types of Plagiarism on the publisher's website for pricing and purchasing information.

Abstract

In the last decade, the plagiarism cases were increased and become a topical problem in the modern scientific world, caused by the quantity of textual information available online/offline. The authors' work deals on the development of a new plagiarism detector system called BHA2 which has as input the suspicious text (to be analysed) and the original texts (learning basis). It can detect the different forms of plagiarism based on: Google API to detect the cases of plagiarism with translation; text summarization to detect the plagiarism of idea; conceptual transformation to detect the plagiarism with synonymy; bag of phrases to detect the paraphraser plagiarism; the social worker bees algorithm that was inspired from the lifestyle of social worker bees (forager, guardian, and cleaner) to select the documents source of plagiarism; the output of the authors' system are the plagiarised passages (the copied parts from the original texts) and the plagiarism percentage for each suspicious text. Their experiments were performed on the Pan 09 dataset and using the validation measures (recall, precision, accuracy, error, f-measure, and entropy, FPR, FNR, W-accuracy, ROC and TCR) in order to show the benefit derived from using such idea compared to the result of classical systems existed in literature. A comparative study in term of services was realised between their system and others commercial systems such as (check, Turnitin, and machine learning system) with their system. Finally, a visualization step was achieved for the purpose to see the outcome in graphical form (3d cub and cobweb) with more realism using the functionalities of zooming and rotation.

Related Content

Tutita M. Casa, Fabiana Cardetti, Madelyn W. Colonnese. © 2024. 14 pages.
R. Alex Smith, Madeline Day Price, Tessa L. Arsenault, Sarah R. Powell, Erin Smith, Michael Hebert. © 2024. 19 pages.
Marta T. Magiera, Mohammad Al-younes. © 2024. 27 pages.
Christopher Dennis Nazelli, S. Asli Özgün-Koca, Deborah Zopf. © 2024. 31 pages.
Ethan P. Smith. © 2024. 22 pages.
James P. Bywater, Sarah Lilly, Jennifer L. Chiu. © 2024. 20 pages.
Ian Jones, Jodie Hunter. © 2024. 20 pages.
Body Bottom