Paper
8 May 2023 Optimizing Winograd convolution on GPUs via multithreaded communication
Yunqing An, Bin Li, Jingde Bu, Ya Gao
Author Affiliations +
Proceedings Volume 12635, Second International Conference on Algorithms, Microchips, and Network Applications (AMNA 2023); 126350W (2023) https://doi.org/10.1117/12.2679935
Event: International Conference on Algorithms, Microchips, and Network Applications 2023, 2023, Zhengzhou, China
Abstract
In advanced High-Performance Computing (HPC), convolution operations take a big proportion in convolutional neural networks, and convolutional neural networks very common in image and video based deep learning applications, because of which, this paper takes improving the performance of convolution operation as the research direction. Convolution can be performance in many ways, such as using mathematical definition to calculate, conversing to Fast Fourier Transform (FFT), conversing to batch matrix multiplication (im2col) or using Winograd algorithm. For small filter, Winograd has unique advantages. AMD based ROCm environment, the implementation of Winograd and an optimization method of Winograd based on multi-thread communication algorithm are introduced in this paper. For the Winograd convolution in ROCm 2.9.0, the speed of the algorithm was increased by more than 150% after optimization in this paper. Under some certain computing power situations, the performance of the optimization algorithm approaches or even exceeds cuDNN and MIOpen.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yunqing An, Bin Li, Jingde Bu, and Ya Gao "Optimizing Winograd convolution on GPUs via multithreaded communication", Proc. SPIE 12635, Second International Conference on Algorithms, Microchips, and Network Applications (AMNA 2023), 126350W (8 May 2023); https://doi.org/10.1117/12.2679935
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Mathematical optimization

Tunable filters

Data processing

Evolutionary algorithms

Matrix multiplication

Convolutional neural networks

Back to Top