Weighted semantic plagiarism detection approach based on AHP decision model

The increasing rate of academic plagiarism is a social problem that engages institutions and publishers. Plagiarists try to mislead the plagiarism detection system using synonyms and inverted word order. Numerous algorithms tried to overcome these problems using structural and semantic detection. However, most of them focus on overcoming some challenges. Moreover, all of them consider the same significant degree for all terms of the documents. On the other hand, the time complexity is an essential parameter that must be considered. This paper presents an effective way to detect structural and semantic similarity degrees among two papers only using some part of the paper's content instead of all content, decreasing the time complexity. The similarity is calculated using a set of impressive terms and various combinations to augment plagiarism detection ability even if the word order is changed. Different weight is assigned to the word according to its position in various sections of the paper. Finally, an AHP (Analytical Hierarchy Process) model uses to calculate a weighted similarity. The results indicated that the proposed approach has more ability to detect semantic academic plagiarism, and the runtime is reduced compared to similar ones.

Medienart:

E-Artikel

Erscheinungsjahr:

2022

Erschienen:

2022

Enthalten in:

Zur Gesamtaufnahme - volume:29

Enthalten in:

Accountability in research - 29(2022), 4 vom: 01. Mai, Seite 203-223

Sprache:

Englisch

Beteiligte Personen:

JavadiMoghaddam, SeyyedMohammad [VerfasserIn]
Roosta, Fatemeh [VerfasserIn]
Noroozi, Asadolla [VerfasserIn]

Links:

Volltext

Themen:

AHP model
Journal Article
Plagiarism detection
Research Support, Non-U.S. Gov't
Semantic plagiarism
Text similarity
WordNet

Anmerkungen:

Date Completed 02.05.2022

Date Revised 05.06.2022

published: Print-Electronic

Citation Status MEDLINE

doi:

10.1080/08989621.2021.1911654

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM323480594