A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting
Website fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these met...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10816323/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841563320224579584 |
---|---|
author | Yaojun Ding Bingxuan Hu |
author_facet | Yaojun Ding Bingxuan Hu |
author_sort | Yaojun Ding |
collection | DOAJ |
description | Website fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these methods can achieve high accuracy, their performance tends to decline over time. This degradation results from the evolution of network protocol versions and the ongoing development of obfuscation techniques, a phenomenon known as concept drift. To address the problem of concept drift, this paper presents a multi-granularity features representation and dimensionality reduction network for Website Fingerprinting, referred to as LRCT. The LRCT network effectively leverages the temporal learning advantages of Local Recurrent Networks (Local RNN) and the spatial learning strengths of Convolutional Neural Network (CNN) by designing the local feature extraction block (denoted as LRC Block), which extracts fine-grained local features from 2000-dimensional original sequences and reduces the dimensionality to 125. The network then uses a Transformer Encoder to capture more robust global features from the low-dimensional data. Experimental results comparing LRCT with state-of-the-art WF methods demonstrate that the proposed LRCT achieves 99.34% accuracy in a closed-world scenario, outperforming other benchmark models. Compared to the benchmark WF-Transformer method, the LRCT network reduces the parameter count by half and decreases the average training time per epoch to approximately 20% of the benchmark. Even under concept drift conditions, LRCT maintains an accuracy of 89.9%. In open-world scenarios, LRCT also has better precision and recall. |
format | Article |
id | doaj-art-bce88489d8e74aa88d51a7e86eaf7ad8 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-bce88489d8e74aa88d51a7e86eaf7ad82025-01-03T00:01:44ZengIEEEIEEE Access2169-35362025-01-011357458710.1109/ACCESS.2024.352289210816323A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website FingerprintingYaojun Ding0https://orcid.org/0009-0007-9169-6056Bingxuan Hu1https://orcid.org/0009-0001-8810-7560School of Artificial Intelligence, Gansu University of Political Science and Law, Lanzhou, ChinaSchool of Cyberspace Security, Gansu University of Political Science and Law, Lanzhou, ChinaWebsite fingerprinting (WF) refers to the identification of target websites accessed by users in anonymous communication scenarios, playing a critical role in cybercrime investigation and forensics. In recent years, deep learning-based WF has become a research focus in this field. Although these methods can achieve high accuracy, their performance tends to decline over time. This degradation results from the evolution of network protocol versions and the ongoing development of obfuscation techniques, a phenomenon known as concept drift. To address the problem of concept drift, this paper presents a multi-granularity features representation and dimensionality reduction network for Website Fingerprinting, referred to as LRCT. The LRCT network effectively leverages the temporal learning advantages of Local Recurrent Networks (Local RNN) and the spatial learning strengths of Convolutional Neural Network (CNN) by designing the local feature extraction block (denoted as LRC Block), which extracts fine-grained local features from 2000-dimensional original sequences and reduces the dimensionality to 125. The network then uses a Transformer Encoder to capture more robust global features from the low-dimensional data. Experimental results comparing LRCT with state-of-the-art WF methods demonstrate that the proposed LRCT achieves 99.34% accuracy in a closed-world scenario, outperforming other benchmark models. Compared to the benchmark WF-Transformer method, the LRCT network reduces the parameter count by half and decreases the average training time per epoch to approximately 20% of the benchmark. Even under concept drift conditions, LRCT maintains an accuracy of 89.9%. In open-world scenarios, LRCT also has better precision and recall.https://ieeexplore.ieee.org/document/10816323/Website fingerprintinglocal recurrent neural networktransformer encodertraffic analysis |
spellingShingle | Yaojun Ding Bingxuan Hu A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting IEEE Access Website fingerprinting local recurrent neural network transformer encoder traffic analysis |
title | A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting |
title_full | A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting |
title_fullStr | A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting |
title_full_unstemmed | A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting |
title_short | A Multi-Granularity Features Representation and Dimensionality Reduction Network for Website Fingerprinting |
title_sort | multi granularity features representation and dimensionality reduction network for website fingerprinting |
topic | Website fingerprinting local recurrent neural network transformer encoder traffic analysis |
url | https://ieeexplore.ieee.org/document/10816323/ |
work_keys_str_mv | AT yaojunding amultigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting AT bingxuanhu amultigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting AT yaojunding multigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting AT bingxuanhu multigranularityfeaturesrepresentationanddimensionalityreductionnetworkforwebsitefingerprinting |