Two-sample tests that are safe under optional stopping

Turner, Rosanne; Ly, Alexander; Grünwald, Peter

We develop E variables for testing whether two data streams come from the same source or not, and more generally, whether the difference between the sources is larger than some minimal effect size. These E variables lead to tests that remain safe, i.e. keep their Type-I error guarantees, under flexible sampling scenarios such as optional stopping and continuation. In special cases our E variables also have an optimal `growth' property under the alternative. We illustrate the generic construction through the special case of 2x2 contingency tables, where we also allow for the incorporation of different restrictions on a composite alternative. Comparison to p-value analysis in simulations and a real-world example show that E variables, through their flexibility, often allow for early stopping of data collection, thereby retaining similar power as classical methods.

Additional Metadata
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Turner, R., Ly, A., & Grünwald, P. (2021). Two-sample tests that are safe under optional stopping.

View at arXiv

Free Full Text ( Final Version , 817kb )

See Also
software\|data safestats: Safe Anytime-Valid Inference R.J. Turner (Rosanne), A. Ly (Alexander), M.F. Pérez (Muriel), J.A. ter Schure (Judith) and P.D. Grünwald (Peter)

Two-sample tests that are safe under optional stopping

Publication

Publication

software|data
safestats: Safe Anytime-Valid Inference

Address

CWI researchers

Questions or comments?

Two-sample tests that are safe under optional stopping

Publication

Publication

software|data safestats: Safe Anytime-Valid Inference

Workflow

Workflow

Add Content

software|data
safestats: Safe Anytime-Valid Inference