March 3, 2026

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Key Points

Online adversarial imitation learning enables efficient learning of near-expert policies using generalized function approximation, and minimizes theoretical limitations.
Both model-free and model-based approaches show polynomial sample complexity for policy learning with AIL algorithms that can be practically implemented.
The introduction of optimization-based AIL simplifies practical implementation by requiring the approximate optimization of just two objectives, enhancing usability.
Empirical studies indicate that OPT-AIL surpasses previous deep AIL methods, highlighting its potential for real-world application over established techniques.

Abstract

Adversarial imitation learning (AIL), a prominent approach in imitation learning, has achieved significant practical success powered by neural network approximation. However, existing theoretical analyses of AIL are primarily confined to simplified settings-such as tabular and linear function approximation-and involve complex algorithmic designs that impede practical implementation. This creates a substantial gap between theory and practice. This paper bridges this gap by exploring the theoretical underpinnings of online AIL with general function approximation. We introduce a novel framework called optimization-based AIL (OPT-AIL), which performs online optimization for reward learning coupled with optimism-regularized optimization for policy learning. Within this framework, we develop two concrete methods: model-free OPT-AIL and model-based OPT-AIL. Our theoretical analysis demonstrates that both variants achieve polynomial expert sample complexity and interaction complexity for learning near-expert policies. To the best of our knowledge, they represent the first provably efficient AIL methods under general function approximation. From a practical standpoint, OPT-AIL requires only the approximate optimization of two objectives, thereby facilitating practical implementation. Empirical studies demonstrate that OPT-AIL outperforms previous state-of-the-art deep AIL methods across several challenging tasks.

Bookmark

Adversarial Imitation Learning with General Function Approximation: Theoretical Analysis and Practical Algorithms

Key Points

Abstract

Cite This Study