What question did this study set out to answer?

The aim is to develop a watermarking method that protects deep learning models by activating marks only after updates.

May 27, 2026Open Access

Watermarking for Model Ownership Verification:Invisible at Deployment, Activated by Updates

Key Points

The aim is to develop a watermarking method that protects deep learning models by activating marks only after updates.
Introduced DormMark, a framework utilizing delayed-activation watermarks.
Employed a three-stage training process: embedding, masking, and activation using triggered samples.
Tested on architectures like VGG19 and ResNet with datasets such as CIFAR-10 and GTSRB.
Achieved 100% watermark success rates across tested models and datasets.
Demonstrated high imperceptibility with PSNR > 38 dB and SSIM = 0.99.
Maintained negligible accuracy loss (< 0.04%) and robustness against 80% parameter pruning.

Abstract

Deep neural networks for image classification require protection against unauthorized use and redistribution. Existing watermarking methods suffer from a critical vulnerability: watermarks are always active and detectable, allowing adversaries to identify and remove them before deployment. We propose DormMark, a novel framework for image classification models that introduces delayed-activation watermarks which remain dormant and hidden under deployment-time black-box query auditing during initial deployment, but automatically activate upon fine-tuning. Our approach employs a three-stage training paradigm: (1) embedding watermarks using triggered samples, (2) masking to suppress watermark functionality while preserving its latent presence, and (3) activation through standard fine-tuning without owner intervention. This mechanism exploits neural networks’ forgetting-remembering behaviors during continued training, creating a fragile equilibrium that behaves similarly to a clean model under deployment-time black-box auditing but reliably manifests ownership indicators after modification. We consider a private-key black-box verification setting in which the owner keeps the concrete trigger instances secret. Experiments across multiple architectures (VGG19, ResNet-18/56, DenseNet-121, WideResNet-34) and datasets (CIFAR-10, CIFAR-100, GTSRB) demonstrate 100% watermark success rates, high imperceptibility (PSNR > 38 dB, SSIM = 0.99), negligible accuracy loss (< 0.04%), and robustness against 80% parameter pruning. DormMark represents a paradigm shift from static to conditionally-activated ownership verification, providing a more robust framework for intellectual property protection.

Watermarking for Model Ownership Verification:Invisible at Deployment, Activated by Updates

Key Points

Abstract

Cite This Study