What question did this study set out to answer?

This research aims to develop a model-free method for estimating p-values in genetic association studies more efficiently.

May 13, 2026Open Access

Kernel-smoothed permutation for extreme p- value estimation in genetic association studies

Key Points

This research aims to develop a model-free method for estimating p-values in genetic association studies more efficiently.
Proposed Kernel-smoothed permutation as a new approach to form null distributions using kurtosis-driven transformation and kernel density estimation.
Compared the performance of Kernel-smoothed permutation to Naïve permutation across three test statistics: t-test, sequence kernel association test, and chi-squared test.
Applied the method to a real-world genome-wide association study involving Crohn’s disease cohort.
Kernel-smoothed permutation significantly reduced the number of required permutations while maintaining similar or higher accuracy compared to Naïve permutation.
Demonstrated efficiency improvements in multiple test corrections for extremely small p-values typical in genome-wide association studies.

Abstract

Abstract In genetic association studies, permutation tests serve as a cornerstone to estimate p-values. This is because researchers may design new test statistics without a known closed-form distribution, or the assumption of a well-established test may not hold. However, permutation tests require a vast number of permutations which is proportional to the magnitude of the actual p-values. When it comes to genome-wide association studies where multiple-test corrections are routinely conducted, the actual p-values are extremely small, requiring a daunting number of permutations that may be beyond the available computational resources. Existing models that reduce the required number of permutations all assume a specific format of the test statistic to exploit its specific statistical properties. We propose Kernel-smoothed permutation which is a model-free method universally applicable to any statistic. Our tool forms the null distribution of test statistics using a kurtosis-driven transformation, followed by a kernel-based density estimation (KDE). We compared our Kernel-smoothed permutation to Naïve permutation using statistics from known closed-form null distributions. Based on three frequently used test statistics in association studies, i.e., t-test, sequence kernel association test (SKAT), and chi-squared test, we demonstrated that our model reduced the required number of permutations by a magnitude with similar or higher accuracy. Based on a real-world genome-wide association study (GWAS) analysis, we used Crohn’s disease cohort to further confirm that our model substantially outperforms the Naïve permutation.

Kernel-smoothed permutation for extreme p- value estimation in genetic association studies

Key Points

Abstract

Cite This Study

Also Consider

Also Consider