Similar text positioning method based on slope-density cluster
Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were an...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2013-09-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841539724164988928 |
---|---|
author | Du ZOU Wen-jun TANG Wei-jiang LONG Ling ZHANG |
author_facet | Du ZOU Wen-jun TANG Wei-jiang LONG Ling ZHANG |
author_sort | Du ZOU |
collection | DOAJ |
description | Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework. |
format | Article |
id | doaj-art-472d5b44f73d46c18098431cd5eb6deb |
institution | Kabale University |
issn | 1000-436X |
language | zho |
publishDate | 2013-09-01 |
publisher | Editorial Department of Journal on Communications |
record_format | Article |
series | Tongxin xuebao |
spelling | doaj-art-472d5b44f73d46c18098431cd5eb6deb2025-01-14T06:42:21ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2013-09-013415716259678430Similar text positioning method based on slope-density clusterDu ZOUWen-jun TANGWei-jiang LONGLing ZHANGSimilar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework.http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/plagiarism detectionsimilar text positioningclusterfingerprint |
spellingShingle | Du ZOU Wen-jun TANG Wei-jiang LONG Ling ZHANG Similar text positioning method based on slope-density cluster Tongxin xuebao plagiarism detection similar text positioning cluster fingerprint |
title | Similar text positioning method based on slope-density cluster |
title_full | Similar text positioning method based on slope-density cluster |
title_fullStr | Similar text positioning method based on slope-density cluster |
title_full_unstemmed | Similar text positioning method based on slope-density cluster |
title_short | Similar text positioning method based on slope-density cluster |
title_sort | similar text positioning method based on slope density cluster |
topic | plagiarism detection similar text positioning cluster fingerprint |
url | http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2013.Z2.030/ |
work_keys_str_mv | AT duzou similartextpositioningmethodbasedonslopedensitycluster AT wenjuntang similartextpositioningmethodbasedonslopedensitycluster AT weijianglong similartextpositioningmethodbasedonslopedensitycluster AT lingzhang similartextpositioningmethodbasedonslopedensitycluster |