The rapid evolution of deep generative models (DGMs) has highlighted their great potential in medical imaging research. Recently, it has been claimed that a diffusion generative model: denoising diffusion probabilistic model (DDPM), performs better at image synthesis than the previously popular DGMs: generative adversarial networks (GANs). However, this claim is based on evaluations employing measures intended for natural images, and thus, does not resolve questions about their relevance to medical imaging tasks. To partially address this problem, we performed a series of assessments to evaluate the ability of a DDPM to reproduce diagnostically relevant spatial context. Our findings show that in all our studies, although context was generally well replicated in DDPM-generated ensembles, it was never perfectly reproduced in the entire ensemble.
|