We demonstrate how e-values simplify both experimental design and the inference process. With e-values researchers can perform anytime-valid tests and construct confidence intervals that maintain type I error control regardless of the sample size. This enables real-time monitoring of evidence as data are collected, permitting early termination of experiments without intolerably inflating the risk of false discoveries. Early stopping not only conserves resources, but also mitigates risk for participants in clinical settings. Anytime-valid tests allow for optional continuation, that is, the extension of an experiment, for instance if more funds become available, or even if the evidence looks promising and the funding agency, a reviewer, or an editor urges the experimenter to collect more data. Analogously, a researcher can be assured that a 95% anytime-valid confidence interval will, with at least 95% probability, cover the true effect size regardless of how, or even if, data collection is stopped. We use the free and open-source software package safestats implemented in R to illustrate the practical benefits of this novel inference framework.

, , , ,
doi.org/10.31234/osf.io/h5vae_v3
Machine Learning

Böhm, U., Ly, A., Grünwald, P., Ramdas, A., & van Ravenzwaaij, D. (2025). A tutorial on safe anytime-valid inference: Practical maximally flexible sampling designs for experiments based on e-values. doi:10.31234/osf.io/h5vae_v3