12 November 2024 BDD-GAN: a brocade image generation method via dual discriminator and feature fusion
Jiachuan Sheng, Jin Zhou, Shuwen Luo, Zhenyang Sun
Author Affiliations +
Abstract

Text-to-image (T2I) generation techniques have shown promising results using deep learning models. The recent T2I methods mainly use complex and stacked structures to maintain text-image consistency or focus on the generation of global text information while overlooking the finer details. In addition, brocade images differ from real-world images in various aspects, such as their fine texture and complex pattern elements (totem elements). To solve the problems, we first build a text-image dataset that specifically focuses on Chinese brocade. Moreover, by developing an extra pattern discriminator and the multi-scale feature fusion module, we proposed the brocade dual-discriminator generative adversarial network (BDD-GAN). BDD-GAN addresses the challenges associated with text-image consistency, as well as capturing the intricate textures and unique patterns in Chinese brocade-generated images. To fully verify the performance and effectiveness of the proposed method, we provided ablation studies and comparisons with previous works. Experimental results indicate that our method is capable of synthesizing Chinese brocade images that are higher in quality and totem details than those produced by previous algorithms.

© 2024 SPIE and IS&T
Jiachuan Sheng, Jin Zhou, Shuwen Luo, and Zhenyang Sun "BDD-GAN: a brocade image generation method via dual discriminator and feature fusion," Journal of Electronic Imaging 33(6), 063014 (12 November 2024). https://doi.org/10.1117/1.JEI.33.6.063014
Received: 11 June 2024; Accepted: 21 October 2024; Published: 12 November 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
Back to Top