A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting

Website fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these met...

Full description

Saved in:
Bibliographic Details
Main Authors: Yaojun Ding, Bingxuan Hu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10816323/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841563320224579584
author Yaojun Ding
Bingxuan Hu
author_facet Yaojun Ding
Bingxuan Hu
author_sort Yaojun Ding
collection DOAJ
description Website fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these methods can achieve high accuracy, their performance tends to decline over time. This degradation results from the evolution of network protocol versions and the ongoing development of obfuscation techniques, a phenomenon known as concept drift. To address the problem of concept drift, this paper presents a multi-granularity features representation and dimensionality reduction network for Website Fingerprinting, referred to as LRCT. The LRCT network effectively leverages the temporal learning advantages of Local Recurrent Networks (Local RNN) and the spatial learning strengths of Convolutional Neural Network (CNN) by designing the local feature extraction block (denoted as LRC Block), which extracts fine-grained local features from 2000-dimensional original sequences and reduces the dimensionality to 125. The network then uses a Transformer Encoder to capture more robust global features from the low-dimensional data. Experimental results comparing LRCT with state-of-the-art WF methods demonstrate that the proposed LRCT achieves 99.34% accuracy in a closed-world scenario, outperforming other benchmark models. Compared to the benchmark WF-Transformer method, the LRCT network reduces the parameter count by half and decreases the average training time per epoch to approximately 20% of the benchmark. Even under concept drift conditions, LRCT maintains an accuracy of 89.9%. In open-world scenarios, LRCT also has better precision and recall.
format Article
id doaj-art-bce88489d8e74aa88d51a7e86eaf7ad8
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-bce88489d8e74aa88d51a7e86eaf7ad82025-01-03T00:01:44ZengIEEEIEEE Access2169-35362025-01-011357458710.1109/ACCESS.2024.352289210816323A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website FingerprintingYaojun Ding0https://orcid.org/0009-0007-9169-6056Bingxuan Hu1https://orcid.org/0009-0001-8810-7560School of Artificial Intelligence, Gansu University of Political Science and Law, Lanzhou, ChinaSchool of Cyberspace Security, Gansu University of Political Science and Law, Lanzhou, ChinaWebsite fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these methods can achieve high accuracy, their performance tends to decline over time. This degradation results from the evolution of network protocol versions and the ongoing development of obfuscation techniques, a phenomenon known as concept drift. To address the problem of concept drift, this paper presents a multi-granularity features representation and dimensionality reduction network for Website Fingerprinting, referred to as LRCT. The LRCT network effectively leverages the temporal learning advantages of Local Recurrent Networks (Local RNN) and the spatial learning strengths of Convolutional Neural Network (CNN) by designing the local feature extraction block (denoted as LRC Block), which extracts fine-grained local features from 2000-dimensional original sequences and reduces the dimensionality to 125. The network then uses a Transformer Encoder to capture more robust global features from the low-dimensional data. Experimental results comparing LRCT with state-of-the-art WF methods demonstrate that the proposed LRCT achieves 99.34% accuracy in a closed-world scenario, outperforming other benchmark models. Compared to the benchmark WF-Transformer method, the LRCT network reduces the parameter count by half and decreases the average training time per epoch to approximately 20% of the benchmark. Even under concept drift conditions, LRCT maintains an accuracy of 89.9%. In open-world scenarios, LRCT also has better precision and recall.https://ieeexplore.ieee.org/document/10816323/Website fingerprintinglocal recurrent neural networktransformer encodertraffic analysis
spellingShingle Yaojun Ding
Bingxuan Hu
A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
IEEE Access
Website fingerprinting
local recurrent neural network
transformer encoder
traffic analysis
title A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
title_full A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
title_fullStr A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
title_full_unstemmed A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
title_short A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
title_sort multi granularity features representation and dimensionality reduction network for website fingerprinting
topic Website fingerprinting
local recurrent neural network
transformer encoder
traffic analysis
url https://ieeexplore.ieee.org/document/10816323/
work_keys_str_mv AT yaojunding amultigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting
AT bingxuanhu amultigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting
AT yaojunding multigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting
AT bingxuanhu multigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting