What does this research mean for the field?

An interpretable knowledge distillation framework utilizing multi-granular semantic alignment, attention-gated distillation, and concept activation preservation achieves a 13.7x compression ratio while maintaining high accuracy retention and preserving human-interpretable decision logic. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The study aims to develop a knowledge distillation framework that maintains accuracy, efficiency, and interpretability for resource-constrained environments.

May 28, 2026Open Access

Interpretability-preserving knowledge distillation via multi-granular feature alignment for resource-efficient CNNs

Key Points

The study aims to develop a knowledge distillation framework that maintains accuracy, efficiency, and interpretability for resource-constrained environments.
Implemented multi-granular semantic alignment for hierarchical feature preservation.
Employed attention-gated distillation to ensure spatial reasoning consistency.
Preserved concept activation for enhancing human interpretability in decision logic.
Achieved 92% accuracy and 99.57% retention on CIFAR-10, with 0.576 average saliency similarity.
Obtained 86.94% accuracy and 89.54% retention on CUB-200-2011, while achieving a 13.7× compression ratio.

Abstract

The application of deep learning models in resource-constrained environments requires a trade-off among accuracy, efficiency, and interpretability a trilemma frequently neglected in conventional knowledge distillation (KD) approaches. This paper introduces an interpretable knowledge distillation framework that transcends these goals through three innovations: (1) multi-granular semantic alignment for hierarchical feature structure preservation, (2) attention-gated distillation to impose spatial reasoning consistency, and (3) concept activation preservation for human-interpretable decision logic. Measured on CIFAR-10 and CUB-200-2011 benchmark datasets, our method obtains 92% accuracy (99. 57% retention) and 86. 94% accuracy (89. 54% retention), respectively, with 0. 576 average saliency similarity to the teacher model. By lowering the complexity of the model by 13. 7 compression ratio without forgoing interpretability, our method facilitates the deployment of interpretable, resource-efficient models in safety-critical settings like medical diagnosis and ecological surveillance. This paper closes the performance explainability gap, pushing the frontiers of reliable AI for edge computing. This framework maintains competitive performance relative to dominant baselines, whilst also optimizing all three aspects of the trilemma, something which prior work has not tackled.

Ask AI

Mark Helpful

Bookmark

Relay

View Full Paper