Generalization in fully-connected neural networks for time series forecasting

Borovykh, Anastasia; Oosterlee, Kees; Bohte, Sander

doi:10.1016/j.jocs.2019.07.007

A.I. Borovykh (Anastasia), C.W. Oosterlee (Kees) and S.M. Bohte (Sander)

2019-09-01

Generalization in fully-connected neural networks for time series forecasting

Journal of Computational Science , Volume 36 p. 101020:1- 101020:15

In this paper we study the generalization capabilities of fully-connected neural networks trained in the context of time series forecasting. Time series do not satisfy the typical assumption in statistical learning theory of the data being i.i.d. samples from some data-generating distribution. We use the input and weight Hessians, that is the smoothness of the learned function with respect to the input and the width of the minimum in weight space, to quantify a network's ability to generalize to unseen data. While such generalization metrics have been studied extensively in the i.i.d. setting of for example image recognition, here we empirically validate their use in the task of time series forecasting. Furthermore we discuss how one can control the generalization capability of the network by means of the training process using the learning rate, batch size and the number of training iterations as controls. Using these hyperparameters one can efficiently control the complexity of the output function without imposing explicit constraints.

Additional Metadata
Keywords	Deep learning, Forecasting, Generalization, Neural networks, Time series
Persistent URL	doi.org/10.1016/j.jocs.2019.07.007
Journal	Journal of Computational Science
Organisation	Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Borovykh, A., Oosterlee, K., & Bohte, S. (2019). Generalization in fully-connected neural networks for time series forecasting. Journal of Computational Science, 36, 101020:1–101020:15. doi:10.1016/j.jocs.2019.07.007

View at Publisher

Full Text ( Final Version , 3mb )

Generalization in fully-connected neural networks for time series forecasting

Publication

Publication

Address

Publishing at CWI

Questions or comments?

Generalization in fully-connected neural networks for time series forecasting

Publication

Publication

Workflow

Workflow

Add Content