Exploring the Need to Use “Plagiarism” Detection Software Rationally

Universities and journals increasingly rely on software tools for detecting textual overlap of a scientific text with the previously published literature to detect potential plagiarism. Although software outputs need to be carefully reviewed by competent humans to verify the existence of plagiarism,...

Full description

Saved in:
Bibliographic Details
Main Authors: Petar Milovanovic, Tatjana Pekmezovic, Marija Djuric
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Publications
Subjects:
Online Access:https://www.mdpi.com/2304-6775/13/1/1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Universities and journals increasingly rely on software tools for detecting textual overlap of a scientific text with the previously published literature to detect potential plagiarism. Although software outputs need to be carefully reviewed by competent humans to verify the existence of plagiarism, university and journal staff, for various reasons, often erroneously interpret the degree of plagiarism based on the percentage of textual overlap shown in the similarity report. This is often accompanied by explicit recommendations to the author(s) to paraphrase the text to achieve an “acceptable” percentage of overlap. Here, based on the available literature and real-world examples from similarity reports, we provide a classification with extensive examples of phrases that falsely inflate the similarity index and argue the futility and dangers of rephrasing such statements just for the sake of reducing the similarity index. The examples provided in this paper call for a more reasonable assessment of text similarity. To fully endorse the principles of academic integrity and prevent loss of clarity of the scientific literature, we believe it is important to shift from pure bureaucratic and quantificational view on the originality of scientific texts to human-centered qualitative assessment of the manuscripts, including the software outputs.
ISSN:2304-6775