To address the limitations of insufficient feature representation, large model size, and high deployment cost in network traffic classification, a lightweight classification framework based on multi-teacher knowledge distillation is proposed. The framework consists of two heterogeneous teacher networks and a compact student network to enable end-to-end traffic classification under constrained computational resources. The teacher networks incorporate complementary spatio-temporal modeling strategies, including a bidirectional temporal convolutional network (BiTCN) enhanced with attention mechanisms and convolutional neural network (CNN), and a parallel spatio-temporal fusion architecture integrating bidirectional long short-term memory (BiLSTM) and CNN. Knowledge from the teacher ensemble is distilled into a lightweight CNN-based student network through soft-target supervision, leading to improved generalization capability with significantly reduced model complexity. Experimental results demonstrate that effective knowledge transfer is achieved while reducing model parameters by more than 80%, and performance gains of about 1–3% are obtained compared with baseline methods. These results indicate strong potential for practical deployment in resource-constrained network environments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Tue,) studied this question.
synapsesocial.com/papers/69d893896c1944d70ce047e3 — DOI: https://doi.org/10.3390/fi18040197
Zhiyuan Li
Beijing Normal University - Hong Kong Baptist University United International College
Yonghao Feng
Jiangsu University
Future Internet
Jiangsu University
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: