Poster + Paper
7 June 2024 Synthesizing textured 3D meshes from pose invariant 2D image representations for optimal face recognition
J. Brennan Peace, Benjamin S. Riggan
Author Affiliations +
Conference Poster
Abstract
For facial recognition, textured three-dimensional (3D) meshes offer critical depth information that enhance identification across multiple perspectives. However, 3D facial recognition is often hindered by data limitations, collection environments, and domain shifts. Therefore, we propose a method to synthesize textured 3D facial meshes using existing two-dimensional (2D) face images. Our method demonstrates improved pose invariance by synthesizing faces and leveraging combinations of synthetic and real 3D facial data to improve facial recognition performance. Figure 1 provides an example of synthesized textured meshes from the Facescape1 dataset, which includes 3D faces, textures, and corresponding 2D images. Through synthesizing 3D geometry and occluded/nonoccluded textures, this method leverages pose invariant features from textured 3D meshes using 2D imagery for complex facial recognition tasks. We implement a 2D-to-3D domain adaptation scheme that enables Adaface2—a leading 2D recognition framework—to discriminate 3D facial features learned from Pointnet++.3 This strengthens off-pose identification, highlighting the importance of data synthesis in expanding capabilities. Our proposed method improves pose invariance by leveraging denoising diffusion probabilistic models (DDPMs)4 conditioned on 2D representations to construct 3D textured meshes. This approach presents a robust alternative to existing methodologies,5, 6 emphasizing the advantages of 2D networks to infer 3D features for enhanced recognition. Leveraging DDPMs and domain adaptation broadens the landscape for image recognition systems, signifying diverse data structures, even synthetic, for improving recognition performance under challenging conditions. The results demonstrate enhanced modeling to bridge the gap between 3D meshes and images.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
J. Brennan Peace and Benjamin S. Riggan "Synthesizing textured 3D meshes from pose invariant 2D image representations for optimal face recognition", Proc. SPIE 13035, Synthetic Data for Artificial Intelligence and Machine Learning: Tools, Techniques, and Applications II, 130351F (7 June 2024); https://doi.org/10.1117/12.3013989
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Point clouds

3D image processing

Volume rendering

3D modeling

Facial recognition systems

Denoising

Image processing

RELATED CONTENT


Back to Top