Efficient segment pattern based method for malicious URL detection

An efficient segment based method for detecting malicious URL was proposed.Firstly it analyzed the annotated malicious URLs in terms of three semantic segments,i.e.,domain segment,path segment and file segment.Secondly it quickly calculated the common pattern of each semantic segment exploiting the...

Full description

Saved in:
Bibliographic Details
Main Authors: Hai-lun LIN, Yan LI, Wei-ping WANG, Yin-liang YUE, Zheng LIN
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2015-11-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015293/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841539553863663616
author Hai-lun LIN
Yan LI
Wei-ping WANG
Yin-liang YUE
Zheng LIN
author_facet Hai-lun LIN
Yan LI
Wei-ping WANG
Yin-liang YUE
Zheng LIN
author_sort Hai-lun LIN
collection DOAJ
description An efficient segment based method for detecting malicious URL was proposed.Firstly it analyzed the annotated malicious URLs in terms of three semantic segments,i.e.,domain segment,path segment and file segment.Secondly it quickly calculated the common pattern of each semantic segment exploiting the tri-gram model based inverted index.Finally it decided whether a given URL was malicious based on the segment patterns returned by searching the inverted index.Moreover,this method also supported the Jaccard based random domain name identification technique for deciding malicious URLs with random domain name.Experimental results show that proposed method outperforms the state-of-the-art baseline methods,and can achieve good efficiency and scalability on malicious URL detection.
format Article
id doaj-art-0d286de6e5324274877d2a4c9faf5653
institution Kabale University
issn 1000-436X
language zho
publishDate 2015-11-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-0d286de6e5324274877d2a4c9faf56532025-01-14T06:54:22ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2015-11-013614114859697777Efficient segment pattern based method for malicious URL detectionHai-lun LINYan LIWei-ping WANGYin-liang YUEZheng LINAn efficient segment based method for detecting malicious URL was proposed.Firstly it analyzed the annotated malicious URLs in terms of three semantic segments,i.e.,domain segment,path segment and file segment.Secondly it quickly calculated the common pattern of each semantic segment exploiting the tri-gram model based inverted index.Finally it decided whether a given URL was malicious based on the segment patterns returned by searching the inverted index.Moreover,this method also supported the Jaccard based random domain name identification technique for deciding malicious URLs with random domain name.Experimental results show that proposed method outperforms the state-of-the-art baseline methods,and can achieve good efficiency and scalability on malicious URL detection.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015293/malicious URLsegment patterntri-graminverted indexrandom name
spellingShingle Hai-lun LIN
Yan LI
Wei-ping WANG
Yin-liang YUE
Zheng LIN
Efficient segment pattern based method for malicious URL detection
Tongxin xuebao
malicious URL
segment pattern
tri-gram
inverted index
random name
title Efficient segment pattern based method for malicious URL detection
title_full Efficient segment pattern based method for malicious URL detection
title_fullStr Efficient segment pattern based method for malicious URL detection
title_full_unstemmed Efficient segment pattern based method for malicious URL detection
title_short Efficient segment pattern based method for malicious URL detection
title_sort efficient segment pattern based method for malicious url detection
topic malicious URL
segment pattern
tri-gram
inverted index
random name
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2015293/
work_keys_str_mv AT hailunlin efficientsegmentpatternbasedmethodformaliciousurldetection
AT yanli efficientsegmentpatternbasedmethodformaliciousurldetection
AT weipingwang efficientsegmentpatternbasedmethodformaliciousurldetection
AT yinliangyue efficientsegmentpatternbasedmethodformaliciousurldetection
AT zhenglin efficientsegmentpatternbasedmethodformaliciousurldetection