Learning a formula of interpretability to learn interpretable formulas

Virgolin, Marco; De Lorenzo, Andrea; Medvet, Eric; Randone, Francesca

doi:10.1007/978-3-030-58115-2_6

M. Virgolin (Marco), A. De Lorenzo (Andrea), E. Medvet (Eric) and F. Randone (Francesca)

2020

Learning a formula of interpretability to learn interpretable formulas

Presented at the 16th International Conference, PPSN 2020 (September 2020), Leiden, The Netherlands

Many risk-sensitive applications require Machine Learning (ML) models to be interpretable. Attempts to obtain interpretable models typically rely on tuning, by trial-and-error, hyper-parameters of model complexity that are only loosely related to interpretability. We show that it is instead possible to take a meta-learning approach: an ML model of non-trivial Proxies of Human Interpretability (PHIs) can be learned from human feedback, then this model can be incorporated within an ML training process to directly optimize for interpretability. We show this for evolutionary symbolic regression. We first design and distribute a survey finalized at finding a link between features of mathematical formulas and two established PHIs, simulatability and decomposability. Next, we use the resulting dataset to learn an ML model of interpretability. Lastly, we query this model to estimate the interpretability of evolving solutions within bi-objective genetic programming. We perform experiments on five synthetic and eight real-world symbolic regression problems, comparing to the traditional use of solution size minimization. The results show that the use of our model leads to formulas that are, for a same level of accuracy-interpretability trade-off, either significantly more or equally accurate. Moreover, the formulas are also arguably more interpretable. Given the very positive results, we believe that our approach represents an important stepping stone for the design of next-generation interpretable (evolutionary) ML algorithms.

Additional Metadata
Keywords	Explainable artificial intelligence, Genetic programming, Interpretable machine learning, Multi-objective, Symbolic regression
Persistent URL	doi.org/10.1007/978-3-030-58115-2_6
Series	Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence
Conference	16th International Conference, PPSN 2020
Organisation	Centrum Wiskunde & Informatica, Amsterdam (CWI), The Netherlands
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Virgolin, M., De Lorenzo, A., Medvet, E., & Randone, F. (2020). Learning a formula of interpretability to learn interpretable formulas. In International Conference on Parallel Problem Solving from Nature, PPSN 2020 (pp. 79–93). doi:10.1007/978-3-030-58115-2_6

View at Publisher

Free Full Text ( Final Version , 841kb )

See Also
other Learning a Formula of Interpretability to Learn Interpretable Formulas M. Virgolin (Marco), A. De Lorenzo (Andrea), E. Medvet (Eric) and F. Randone (Francesca)

Learning a formula of interpretability to learn interpretable formulas

Publication

Publication

other
Learning a Formula of Interpretability to Learn Interpretable Formulas

Address

CWI researchers

Questions or comments?

Learning a formula of interpretability to learn interpretable formulas

Publication

Publication

other Learning a Formula of Interpretability to Learn Interpretable Formulas

Workflow

Workflow

Add Content

other
Learning a Formula of Interpretability to Learn Interpretable Formulas