To address the limitations of insufficient feature representation, large model size, and high deployment cost in network traffic classification, a lightweight classification framework based on multi-teacher knowledge distillation is proposed. The framework consists of two heterogeneous teacher networks and a compact student network to enable end-to-end traffic classification under constrained computational resources. The teacher networks incorporate complementary spatio-temporal modeling strategies, including a bidirectional temporal convolutional network (BiTCN) enhanced with attention mechanisms and convolutional neural network (CNN), and a parallel spatio-temporal fusion architecture integrating bidirectional long short-term memory (BiLSTM) and CNN. Knowledge from the teacher ensemble is distilled into a lightweight CNN-based student network through soft-target supervision, leading to improved generalization capability with significantly reduced model complexity. Experimental results demonstrate that effective knowledge transfer is achieved while reducing model parameters by more than 80%, and performance gains of about 1–3% are obtained compared with baseline methods. These results indicate strong potential for practical deployment in resource-constrained network environments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhiyuan Li
Beijing Normal University - Hong Kong Baptist University United International College
Yonghao Feng
Jiangsu University
Future Internet
Jiangsu University
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Tue,) studied this question.
synapsesocial.com/papers/69d893896c1944d70ce047e3 — DOI: https://doi.org/10.3390/fi18040197
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: