Combinatorial Tiling for Sparse Neural Networks
Publication date
2020
Authors
Pawłowski,, Filip
Bisseling, R.H.
Uçar , Bora
Yzelman, Albert-Jan N.
Editors
Advisors
Supervisors
Document Type
Part of book
Metadata
Show full item recordCollections
License
Abstract
Sparse deep neural networks (DNNs) emerged as the result of search for networks with less storage and lower computational complexity. The sparse DNN inference is the task of using such trained DNN networks to classify a batch of input data. We propose an efficient, hybrid model- and data-parallel DNN inference using hypergraph models and partitioners. We exploit tiling and weak synchronization to increase cache reuse, hide load imbalance, and hide synchronization costs. Finally, a blocking approach allows application of this new hybrid inference procedure for deep neural networks. We initially experiment using the hybrid tiled inference approach only, using the first five layers of networks from the IEEE HPEC 2019 Graph Challenge, and attain up to 2 x speedup versus a data-parallel baseline.
Keywords
Neural networks, Sparse matrices, Synchronization, Task analysis, Load modeling, Taverne
Citation
Pawłowski, F , Bisseling , R H , Uçar , B & Yzelman , A-J N 2020 , Combinatorial Tiling for Sparse Neural Networks . in Proceedings 2020 IEEE High Performance Extreme Computing Conference (HPEC) . IEEE Computer Society Publications , pp. 1-7 . https://doi.org/10.1109/HPEC43674.2020.9286154