Exploring the Search Space of Neural Network Combinations obtained with Efficient Model Stitching

Publication date

2024-07-14

Authors

Guijt, Arthur
Thierens, D.ISNI 0000000390770297
Alderliesten, Tanja
Bosman, Peter A.N.

Editors

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

cc_by

Abstract

Machine learning models can be made more performant and their predictions more consistent by creating an ensemble. Each neural network in an ensemble commonly performs its own feature extraction. These features are often highly similar, leading to potentially many redundant calculations. Unifying these calculations (i.e., reusing some of them) would be desirable to reduce computational cost. However, splicing two trained networks is non-trivial because architectures and feature representations typically differ, leading to a performance breakdown. To overcome this issue, we propose to employ stitching, which introduces new layers at crossover points. Essentially, a new network consisting of the two basis networks is constructed. In this network, new links between the two basis networks are created through the introduction and training of stitches. New networks can then be created by choosing which stitching layers to (not) use, thereby selecting a subnetwork. Akin to a supernetwork, assessing the performance of a selected subnetwork is efficient, as only their evaluation on data is required. We experimentally show that our proposed approach enables finding networks that represent novel trade-offs between performance and computational cost compared to classical ensembles, with some new networks even dominating the original networks.

Keywords

ensembles, neural architecture search, neuroevolution, stitching, Artificial Intelligence, Software, Control and Optimization, Discrete Mathematics and Combinatorics, Logic

Citation

Guijt, A, Thierens, D, Alderliesten, T & Bosman, P A N 2024, Exploring the Search Space of Neural Network Combinations obtained with Efficient Model Stitching. in GECCO 2024 Companion - Proceedings of the 2024 Genetic and Evolutionary Computation Conference Companion. GECCO 2024 Companion - Proceedings of the 2024 Genetic and Evolutionary Computation Conference Companion, Association for Computing Machinery, pp. 1914-1923, 2024 Genetic and Evolutionary Computation Conference Companion, GECCO 2024 Companion, Melbourne, Australia, 14/07/24. https://doi.org/10.1145/3638530.3664131, conference