Genetic programming (GP) approaches are among the state-of-the-art for symbolic regression, the task of constructing symbolic expressions that fit well with data. To find highly accurate symbolic expressions, both the expression structure and any contained real-valued constants, are important. GP-GOMEA, a modern model-based evolutionary algorithm, is one of the leading algorithms for finding accurate, yet compact expressions. Yet, GP-GOMEA does not perform dedicated constant optimization, but rather uses ephemeral random constants. Hence, the accuracy of GP-GOMEA may well still be improved upon by the incorporation of a constant optimization mechanism. Existing research into mixed discrete-continuous optimization with EAs has shown that a simultaneous and well-integrated approach to optimizing both discrete and continuous parts, leads to the best results on a variety of problems, especially when there are interactions between these parts. In this paper, we therefore propose a novel approach where constants in expressions are optimized at the same time as the expression structure by merging the real-valued variant of GOMEA with GP-GOMEA. The proposed approach is compared to other forms of handling constants in GP-GOMEA, and in the context of other commonly used techniques such as linear scaling, restarts, and constant tuning after GP optimization. Our results indicate that our novel approach generally performs best and confirms the importance of simultaneous constant optimization during evolution.

, , ,
doi.org/10.1007/978-3-031-70055-2_15
Lecture Notes in Computer Science , International Conference on Parallel Problem Solving from Nature
18th International Conference, PPSN 2024
Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands

Koch, J., Alderliesten, T., & Bosman, P. (2024). Simultaneous model-based evolution of constants and expression structure in GP-GOMEA for symbolic regression. In Proceedings of PPSN 2024 (pp. 238–255). doi:10.1007/978-3-031-70055-2_15