Existing post hoc XAI (Explainable Artificial Intelligence) methods produce numerical attributions without symbolic structure (SHAP, LIME), low-coverage local rules (Anchors), or crisp tree surrogates without an interpretable rule-level uncertainty proxy. We present a fuzzy rule-based explanation framework for tabular black-box classifiers, extracting global IF–THEN rules with linguistic labels. This was validated on a 13-dataset benchmark with four model families (Wilcoxon, Friedman, TOST equivalence): (i) prediction-boundary-aware fuzzy partitioning raises mean fidelity from a vanilla Wang–Mendel baseline of 0.736 to 0.893 (+10.4 pp excluding the Breast Cancer outlier; +15.7 pp aggregate, both transparently reported); (ii) fired-rule consequent entropy provides a zero-cost rule-level uncertainty proxy (Spearman ρ = 0.420 with model prediction entropy, significant on 11/12 datasets—moderate by Cohen’s convention, with a 4/12 weak-correlation tail; complementary to probability-entropy and margin baselines). Fidelity is statistically equivalent to tree surrogates on classification (TOST p = 0.002, δ = 0.05) at ≈100% coverage. SHAP/LIME are excluded from the formal stability ranking because the perturbation metric measures the wrapped black-box rather than the attribution vector; cross-explainer comparison is reported in grouped form (full-coverage surrogates vs. local-coverage methods). On continuous regression (California Housing fidelity 0.422 vs. TreeSurrogate 0.840) and XOR-type multi-feature interactions, the framework is structurally weaker, addressed by a planned TSK extension.
A. Tekín (Thu,) studied this question.