Large-scale duplicate image retrieval technical research for the internet

For the typical social media application on the internet,a large-scale distributed duplicate image retrieval ap-proach based on random projection and the block DCT coefficients was proposed.On the basis of Hadoop,this approach exploited image signatures generated by random projection mapping to retr...

Full description

Saved in:
Bibliographic Details
Main Authors: Shu-peng WANG, Ming CHEN, Guang-jun WU
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2014-12-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.12.023/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539736359927808
author Shu-peng WANG
Ming CHEN
Guang-jun WU
author_facet Shu-peng WANG
Ming CHEN
Guang-jun WU
author_sort Shu-peng WANG
collection DOAJ
description For the typical social media application on the internet,a large-scale distributed duplicate image retrieval ap-proach based on random projection and the block DCT coefficients was proposed.On the basis of Hadoop,this approach exploited image signatures generated by random projection mapping to retrieve HBase efficiently.And candidate images with high-recall were achieved.Then in order to improve the retrieval precision,the block DCT coefficients were used to further filter candidate images.For 12 million images,experimental results showed that with our approach the recall ratio reached 98%,the precision ratio reached 93.2%,and the average retrieval time was 6.7s when H=2 and T=150.
format Article
id doaj-art-6caa2c4097244e7cbc1edd342cf19a4a
institution Kabale University
issn 1000-436X
language zho
publishDate 2014-12-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-6caa2c4097244e7cbc1edd342cf19a4a2025-01-14T06:44:42ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2014-12-013519620259687772Large-scale duplicate image retrieval technical research for the internetShu-peng WANGMing CHENGuang-jun WUFor the typical social media application on the internet,a large-scale distributed duplicate image retrieval ap-proach based on random projection and the block DCT coefficients was proposed.On the basis of Hadoop,this approach exploited image signatures generated by random projection mapping to retrieve HBase efficiently.And candidate images with high-recall were achieved.Then in order to improve the retrieval precision,the block DCT coefficients were used to further filter candidate images.For 12 million images,experimental results showed that with our approach the recall ratio reached 98%,the precision ratio reached 93.2%,and the average retrieval time was 6.7s when H=2 and T=150.http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.12.023/social mediarandom projection mappingimage signatureblock DCT coefficientsHadoop cluster
spellingShingle Shu-peng WANG
Ming CHEN
Guang-jun WU
Large-scale duplicate image retrieval technical research for the internet
Tongxin xuebao
social media
random projection mapping
image signature
block DCT coefficients
Hadoop cluster
title Large-scale duplicate image retrieval technical research for the internet
title_full Large-scale duplicate image retrieval technical research for the internet
title_fullStr Large-scale duplicate image retrieval technical research for the internet
title_full_unstemmed Large-scale duplicate image retrieval technical research for the internet
title_short Large-scale duplicate image retrieval technical research for the internet
title_sort large scale duplicate image retrieval technical research for the internet
topic social media
random projection mapping
image signature
block DCT coefficients
Hadoop cluster
url http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.12.023/
work_keys_str_mv AT shupengwang largescaleduplicateimageretrievaltechnicalresearchfortheinternet
AT mingchen largescaleduplicateimageretrievaltechnicalresearchfortheinternet
AT guangjunwu largescaleduplicateimageretrievaltechnicalresearchfortheinternet