Network log analysis with SQL-on-Hadoop

With the rapid expansion of network bandwidth,devices and applications,log management is facing the challenge of exploding data volumes.Log analysis platform built on SQL-on-Hadoop is capable of storing and querying hundreds of billions of log entries effectively.Columnar and compressed data formats...

Full description

Saved in:
Bibliographic Details
Main Authors: Si-yu ZHANG, Kai-da JIANG, Jian-wen WEI, Xuan LUO, Hai-yang WANG
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2014-10-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.3969/j.issn.1000-436x.2014.z1.004/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:With the rapid expansion of network bandwidth,devices and applications,log management is facing the challenge of exploding data volumes.Log analysis platform built on SQL-on-Hadoop is capable of storing and querying hundreds of billions of log entries effectively.Columnar and compressed data formats for Hadoop are benchmarked with real-world multi-TB dataset.Conditional and statistical querying efficiency of Hive and Impala is tested.With gzipped parquet format,log data can be compressed by 80%,and querying with impala is 5 times faster.On this platform,six security incident analysis and detection applications are already deployed.
ISSN:1000-436X