March 3, 2026Open Access

When Pigs Fly: Evaluating Semantic Overconfidence in Deep Neural Network Classifiers

Key Points

Semantic overconfidence affects the predictive probabilities of deep neural networks, leading to inaccurate classifications despite irrelevant features present.
Experiments with generative models revealed how this phenomenon persists across various datasets of factual and counterfactual image pairs.
Neural networks displayed a notable challenge in adapting to irrelevant features, maintaining constant output probabilities regardless of semantic relevance.
Application of Bayesian methods appears to offer a promising avenue for addressing this semantic overconfidence, suggesting future directions for model improvement.

Abstract

We introduce semantic overconfidence as the phenomenon where a model’s output probability remains invariant regardless of the presence or absence of a semantically strong but class-irrelevant features in the image. We adopt generative models to introduce such types of features and create three datasets of factual and counterfactual pairs to study model predictive probabilities. Our experiments indicate that neural networks indeed suffer from this type of semantic challenge. We also provide empirical evidence suggesting that Bayesian methods have the potential to alleviate this problem.

When Pigs Fly: Evaluating Semantic Overconfidence in Deep Neural Network Classifiers

Key Points

Abstract

Cite This Study