A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Grünwald, Peter; Mehta, Nishant

We present a novel notion of complexity that interpolates between and generalizes some classic existing complexity notions in learning theory: for estimators like empirical risk minimization (ERM) with arbitrary bounded losses, it is upper bounded in terms of data-independent Rademacher complexity; for generalized Bayesian estimators, it is upper bounded by the data-dependent information complexity (also known as stochastic or PAC-Bayesian, KL(posterior∥prior) complexity. For (penalized) ERM, the new complexity reduces to (generalized) normalized maximum likelihood (NML) complexity, i.e. a minimax log-loss individual-sequence regret. Our first main result bounds excess risk in terms of the new complexity. Our second main result links the new complexity via Rademacher complexity to L₂(P) entropy, thereby generalizing earlier results of Opper, Haussler, Lugosi, and Cesa-Bianchi who did the log-loss case with L_∞. Together, these results recover optimal bounds for VC- and large (polynomial entropy) classes, replacing localized Rademacher complexity by a simpler analysis which almost completely separates the two aspects that determine the achievable rates: 'easiness' (Bernstein) conditions and model complexity.

Additional Metadata
Organisation	Machine Learning
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Grünwald, P.& Mehta, N. (2017). A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity.

View at arXiv

Free Full Text ( Final Version , 933kb )

See Also
inProceedings A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity P.D. Grünwald (Peter) and N.A. Mehta (Nishant)

A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Publication

Publication

inProceedings
A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Address

CWI researchers

Questions or comments?

A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Publication

Publication

inProceedings A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

Workflow

Workflow

Add Content

inProceedings
A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity