28 March 2022 Optimization of the routing of capsule network based on multiscale information and self-attention mechanism
Yunhao Shang, Ning Xu, Zhenzhou Jin, Xiao Yao
Author Affiliations +
Abstract

Capsule network (CapsNet) is well known as an evolution of classical convolution neural network, which is good at recognizing postures, orientations, and textures, thus achieving promising results in areas such as image classification. However, the high computational complexity hinders them from building larger models. Fortunately, reducing the complexity of the routing of CapsNet through the channel attention mechanism has achieved significantly good results, whereas there is a possibility of further optimization of the storage and precision. Therefore, we propose an improved version of combination of multiscale routing and self-attention mechanism on the basis of the traditional channel attention routing. First, we absorb multiscale information into the routing process, thus enriching the representation capability of the capsule. Second, the global information of the capsule is captured by the self-attention routing, which leads to less misclassification errors. Finally, the proposed self-attention routing is further improved by a soft-threshold strategy, which resists the interference of background noises on complex datasets. Experimental results on MNIST, affNIST, and Cifar10 show that the proposed methods can obtain better performances with fewer training parameters compared with traditional methods on average.

© 2022 SPIE and IS&T 1017-9909/2022/$28.00 © 2022 SPIE and IS&T
Yunhao Shang, Ning Xu, Zhenzhou Jin, and Xiao Yao "Optimization of the routing of capsule network based on multiscale information and self-attention mechanism," Journal of Electronic Imaging 31(2), 023015 (28 March 2022). https://doi.org/10.1117/1.JEI.31.2.023015
Received: 18 June 2021; Accepted: 4 March 2022; Published: 28 March 2022
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Convolution

Expectation maximization algorithms

Data modeling

Feature extraction

Neural networks

Performance modeling

Image classification

Back to Top