Similar text positioning method based on slope-density cluster

Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were an...

Full description

Saved in:
Bibliographic Details
Main Authors: Du ZOU, Wen-jun TANG, Wei-jiang LONG, Ling ZHANG
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2013-09-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539724164988928
author Du ZOU
Wen-jun TANG
Wei-jiang LONG
Ling ZHANG
author_facet Du ZOU
Wen-jun TANG
Wei-jiang LONG
Ling ZHANG
author_sort Du ZOU
collection DOAJ
description Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework.
format Article
id doaj-art-472d5b44f73d46c18098431cd5eb6deb
institution Kabale University
issn 1000-436X
language zho
publishDate 2013-09-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-472d5b44f73d46c18098431cd5eb6deb2025-01-14T06:42:21ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2013-09-013415716259678430Similar text positioning method based on slope-density clusterDu ZOUWen-jun TANGWei-jiang LONGLing ZHANGSimilar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework.http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/plagiarism detectionsimilar text positioningclusterfingerprint
spellingShingle Du ZOU
Wen-jun TANG
Wei-jiang LONG
Ling ZHANG
Similar text positioning method based on slope-density cluster
Tongxin xuebao
plagiarism detection
similar text positioning
cluster
fingerprint
title Similar text positioning method based on slope-density cluster
title_full Similar text positioning method based on slope-density cluster
title_fullStr Similar text positioning method based on slope-density cluster
title_full_unstemmed Similar text positioning method based on slope-density cluster
title_short Similar text positioning method based on slope-density cluster
title_sort similar text positioning method based on slope density cluster
topic plagiarism detection
similar text positioning
cluster
fingerprint
url http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/
work_keys_str_mv AT duzou similartextpositioningmethodbasedonslopedensitycluster
AT wenjuntang similartextpositioningmethodbasedonslopedensitycluster
AT weijianglong similartextpositioningmethodbasedonslopedensitycluster
AT lingzhang similartextpositioningmethodbasedonslopedensitycluster