What question did this study set out to answer?

This work investigates the relationship between model extraction attacks and copyright violations, highlighting their interconnection.

February 27, 2026Open Access

Model Extraction as a Dual Technical–Legal Attack Surface: Transitive Copyright Harm from Surrogate Models

Key Points

This work investigates the relationship between model extraction attacks and copyright violations, highlighting their interconnection.
Conducted a structural analysis of model extraction and copyright infringement pathways.
Examined the repercussions of surrogate models on original copyright protections.
Discussed implications for governance and standards development in AI.
Model extraction can lead to the unauthorized reproduction of copyrighted content.
Identified a linkage between technical breaches and legal repercussions for original content creators.
Proposed changes needed in governance and liability frameworks to address these risks.

Abstract

Model extraction attacks are typically framed as threats to model confidentiality, competitive advantage, and safety-layer integrity. Separately, memorization in large models is treated as a privacy or safety concern, while copyright litigation focuses on training-data provenance and output-level infringement. This paper argues that these domains cannot be treated independently. When an adversary extracts or approximates a deployed model, the resulting surrogate can reproduce memorized training data—including copyrighted text, code, and other creative works—without the guardrails present in the original deployment context. This creates a unified attack surface in which a technical breach (extraction) induces downstream legal harm (copyright infringement) to third-party authors who were never part of the original threat model. We present a structural analysis of this cross-domain propagation pathway and outline the implications for governance, liability, and standards development.

Model Extraction as a Dual Technical–Legal Attack Surface: Transitive Copyright Harm from Surrogate Models

Key Points

Abstract

Cite This Study