Symbolic regression is NP-hard

Virgolin, Marco; Pissis, Solon

Symbolic regression (SR) is the task of learning a model of data in the form of a mathematical expression. By their nature, SR models have the potential to be accurate and human-interpretable at the same time. Unfortunately, finding such models, i.e., performing SR, appears to be a computationally intensive task. Historically, SR has been tackled with heuristics such as greedy or genetic algorithms and, while some works have hinted at the possible hardness of SR, no proof has yet been given that SR is, in fact, NP-hard. This begs the question: Is there an exact polynomial-time algorithm to compute SR models? We provide evidence suggesting that the answer is probably negative by showing that SR is NP-hard.

Additional Metadata
Journal	Transactions on Machine Learning Research
Organisation	Evolutionary Intelligence
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Virgolin, M., & Pissis, S. (2022). Symbolic regression is NP-hard. Transactions on Machine Learning Research, 10, 1–11.

Free Full Text ( Final Version , 439kb )

Symbolic regression is NP-hard

Publication

Publication

Address

CWI researchers

Questions or comments?

Symbolic regression is NP-hard

Publication

Publication

Workflow

Workflow

Add Content