In practical applications, to improve the real-time performance of end-to-end stereo matching networks, the existing methods build cost volume at low resolution. However, with detailed information missing in low-resolution features, it is difficult to get accurate disparity estimation results in weak texture regions. Besides, smooth L1 loss supervision also results in a loss of accuracy in disparity discontinuity areas. To solve these problems, we propose an efficient stereo-matching network based on multiple attention mechanisms and edge optimization, which can achieve high accuracy in a short time. The multi-scale attention module is applied to enhance the feature expression in detail regions. For weak texture areas, we construct a concatenation cost volume and a multi-level patch matching volume, which can be combined to improve the network’s attention to weak texture regions. In terms of edge optimization, we perform bimodal Laplace modeling of the sampled edge points’ disparity distribution and optimize the edge region of the initial disparity map using likelihood loss to obtain sharp edges. The experimental results show that, on the SceneFlow and KITTI datasets, the proposed network improves by 32% and 27% in accuracy compared with BGNet+.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.