Similar text positioning method based on slope-density cluster

Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were an...

Full description

Saved in:
Bibliographic Details
Main Authors: Du ZOU, Wen-jun TANG, Wei-jiang LONG, Ling ZHANG
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2013-09-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework.
ISSN:1000-436X