SLSB-forest:approximate k nearest neighbors searching on high dimensional data

The study of approximate k nearest neighbors query has attracted broad attention.Local sensitive hash is one of the mainstream ways to solve this problem.Local sensitive hash and its varients have noted the following problems:the uneven distribution of hashed data in the buckets,it cannot calculate...

Full description

Saved in:
Bibliographic Details
Main Authors: Tu QIAN, Jiangbo QIAN, Yihong DONG, Huahui CHEN
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2017-09-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017193/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The study of approximate k nearest neighbors query has attracted broad attention.Local sensitive hash is one of the mainstream ways to solve this problem.Local sensitive hash and its varients have noted the following problems:the uneven distribution of hashed data in the buckets,it cannot calculate the appropriate query range (for the parameter k) to build index.To tackle the above problem,a new data struct which called SLSB-forest was presented.The SLSB-forest combined the LSH and the B-tree to maintain the amount of bucket’s data in reasonable range.Two query algorithms were proposed:fast and accurate priority search.Theory and experimental results prove that query range can dynamic change during searching approximate k nearest neighbors.
ISSN:1000-0801