A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Grünwald, Peter; Mehta, Nishant

We present a novel notion of complexity that interpolates between and generalizes some classic existing complexity notions in learning theory: for estimators like empirical risk minimization (ERM) with arbitrary bounded losses, it is upper bounded in terms of data-independent Rademacher complexity; for generalized Bayesian estimators, it is upper bounded by the data-dependent information complexity (also known as stochastic or PAC-Bayesian, KL(posterior∥prior) complexity. For (penalized) ERM, the new complexity reduces to (generalized) normalized maximum likelihood (NML) complexity, i.e. a minimax log-loss individual-sequence regret. Our first main result bounds excess risk in terms of the new complexity. Our second main result links the new complexity via Rademacher complexity to L2(P) entropy, thereby generalizing earlier results of Opper, Haussler, Lugosi, and Cesa-Bianchi who did the log-loss case with L∞. Together, these results recover optimal bounds for VC- and large (polynomial entropy) classes, replacing localized Rademacher complexity by a simpler analysis which almost completely separates the two aspects that determine the achievable rates: 'easiness' (Bernstein) conditions and model complexity.

Additional Metadata
Project	Safe Bayesian Inference: A Theory of Misspecification based on Statistical Learning
Conference	Algorithmic Learning Theory
Grant	This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/617.001.651 - Safe Bayesian Inference: A Theory of Misspecification based on Statistical Learning
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Grünwald, P., & Mehta, N. (2019). A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity. In Proceedings ALT (Algorithmic Learning Theory) (pp. 433–465).

Free Full Text ( Final Version , 1mb )

See Also
techReport A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity P.D. Grünwald (Peter) and N.A. Mehta (Nishant)

A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Publication

Publication

techReport
A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Address

CWI researchers

Questions or comments?

A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Publication

Publication

techReport A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Workflow

Workflow

Add Content

techReport
A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity