Presentation + Paper
2 April 2024 Enhancing colorectal cancer diagnosis through generative models and vision-based tactile sensing: a Sim2Real study
Siddhartha Kapuria, Naruhiko Ikoma, Sandeep Chinchali, Farshid Alambeigi
Author Affiliations +
Abstract
Towards the goal of developing an informed, intuitive, and generalized artificial intelligence model for the early-stage diagnosis of Colorectal Cancer (CRC), in this work, we present a generative model-based technique to improve the training and generalization performance of machine learning classification algorithms. Through this approach, we address the challenge of acquiring sizable and well-balanced datasets within the clinical domain. Our methodology involves training generative models on already available medical data, learning the latent representations, and finally generating new synthetic samples to be used for downstream tasks. We train dedicated UNet2D-based Denoising Diffusion Probabilistic Models (DDPMs) using our custom dataset, which consists of textural images captured by our novel Vision-based Tactile Sensor (VS-TS), called Hysense. These UNet2D DDPMs are employed to generate synthetic images for each potential class. To thoroughly study the effectiveness of using synthetic images during training, we compared the performance of multiple classification models, ranging from simple to state-of-the-art approaches, with our evaluation focusing solely on real images. Specifically for our dataset, we also extend the use of dedicated UNet2D DDPMs to generate synthetic images of not just possible classes, but also other features that may be present in the image, such as whole or partial contact of sensor with polyp phantoms. Through our experimental analyses, we demonstrated that the utilization of generative models to enrich existing datasets with synthetic images leads to improved classification performance and a reduction in model biases.
Conference Presentation
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Siddhartha Kapuria, Naruhiko Ikoma, Sandeep Chinchali, and Farshid Alambeigi "Enhancing colorectal cancer diagnosis through generative models and vision-based tactile sensing: a Sim2Real study", Proc. SPIE 12931, Medical Imaging 2024: Imaging Informatics for Healthcare, Research, and Applications, 129310N (2 April 2024); https://doi.org/10.1117/12.3006550
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Education and training

Polyps

Visual process modeling

Performance modeling

Sensors

Image classification

Back to Top