Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha

Grünwald, Peter

A standard practice in statistical hypothesis testing is to mention the p-value alongside the accept/reject decision. We show the advantages of mentioning an e-value instead. With p-values, it is not clear how to use an extreme observation (e.g. p ≪α) for getting better frequentist decisions. With e-values it is straightforward, since they provide Type-I risk control in a generalized Neyman-Pearson setting with the decision task (a general loss function) determined post-hoc, after observation of the data -- thereby providing a handle on `roving α's'. When Type-II risks are taken into consideration, the only admissible decision rules in the post-hoc setting turn out to be e-value-based. Similarly, if the loss incurred when specifying a faulty confidence interval is not fixed in advance, standard confidence intervals and distributions may fail whereas e-confidence sets and e-posteriors still provide valid risk guarantees. Sufficiently powerful e-values have by now been developed for a range of classical testing problems. We discuss the main challenges for wider development and deployment.

Additional Metadata
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Grünwald, P. (2024). Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha.

View at arXiv

Free Full Text ( Final Version , 396kb )

See Also
article Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha P.D. Grünwald (Peter)

Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha

Publication

Publication

article
Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha

Address

CWI researchers

Questions or comments?

Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha

Publication

Publication

article Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha

Workflow

Workflow

Add Content

article
Beyond Neyman-Pearson: E-values enable hypothesis testing with a data-driven alpha