KEYWORDS: 3D modeling, Education and training, Image segmentation, Network architectures, 3D mask effects, RGB color model, Binary data, 3D image reconstruction, 3D image processing, Neural networks
In recent years, various methods have been proposed for reconstructing the 3D shape of an object from a single view image. While methods that reconstruct the object as a single model show promising results, they often lack part-level details. On the other hand, part-level reconstruction methods provide recognition of parts but struggle to represent detailed shapes due to the use of a single primitive. To address this issue, this paper proposes a Compositionally Generalizable 3D Structure Prediction Network using Multiple Types of Primitives (CompNet-MTP). CompNet-MTP first estimates the parameters of each type of primitive for every part and then selects the appropriate primitive type to construct the 3D shape of the object. In the experiments, we used cylinders in addition to cuboids, which are commonly used as primitive shapes. Experimental results confirm the effectiveness of the proposed network in handling multiple types of primitives.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.