The validity of classical hypothesis testing requires the significance level α be fixed before any statistical analysis takes place. This is a stringent requirement. For instance, it prohibits updating α during (or after) an experiment due to changing concern about the cost of false positives, or to reflect unexpectedly strong evidence against the null. Perhaps most disturbingly, witnessing a p-value p ≪ α vs p=α−ϵ for tiny ϵ ' 0 has no (statistical) relevance for any downstream decision-making. Following recent work of Grünwald [1], we develop a theory of post-hoc hypothesis testing, enabling α to be chosen after seeing and analyzing the data. To study “good” post-hoc tests we introduce Γ-admissibility, where Γ is a set of adversaries which map the data to a significance level. We classify the set of Γ-admissible rules for various sets Γ, showing they must be based on e-values, and recover the Neyman-Pearson lemma when Γ is the constant map.

, , ,
doi.org/10.1016/j.ijar.2026.109634
International Journal of Approximate Reasoning
Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands

Chugg, B., Lardy, T., Ramdas, A., & Grünwald, P. (2026). On admissibility in post-hoc hypothesis testing. International Journal of Approximate Reasoning, 191, 109634:1–109634:33. doi:10.1016/j.ijar.2026.109634