An AkNN Algorithm for High-Dimensional Big Data

A new variant of k nearest neighbor queries,which called as all k-nearest neighbor queries(AkNN),is a process to search the k nearest neighbors of each object in a data set.An AkNN query algorithm for high-dimensional big data on the Hadoop system was proposed.Using the banding technique and the p-s...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhongwei Wang, Yefang Chen, Siyou Xiao, Jiangbo Qian
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2015-07-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2015171/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:A new variant of k nearest neighbor queries,which called as all k-nearest neighbor queries(AkNN),is a process to search the k nearest neighbors of each object in a data set.An AkNN query algorithm for high-dimensional big data on the Hadoop system was proposed.Using the banding technique and the p-stable LSH algorithm,dimensionality reduction was performed,then the data was embeded in a Z-order curve.The preprocessed data were continued to be treated on a MapReduce framework in a distributed parallel manner.Experimental results show that the proposed algorithm can efficiently handle AkNN queries for large-scale high-dimensional data.
ISSN:1000-0801