antic segmentation of aerial images is a critical task in various domains such as urban planning, and monitoring deforestation or critical infrastructure. However, the annotation process required for training accurate segmentation models is often time-consuming and labor-intensive. This paper presents a novel approach to address this challenge by leveraging the power of clustering techniques applied to the embeddings obtained from a SimCLRv2 model pretrained on the ImageNet dataset. By using this clustering approach, fewer training samples are needed, and the annotation only needs to be done for each cluster instead of each pixel in the image, significantly reducing the annotation time. Our proposed method uses SimCLRv2 to obtain rich feature representations (embeddings) from a dataset of unlabeled aerial images. These embeddings are then subjected to clustering, enabling the grouping of semantically similar image regions. In addition to directly using these clusters as class labels, we can treat these clusters as pseudo-classes, allowing us to construct a pseudo-label dataset for fine-tuning a segmentation network. Through experiments conducted on two benchmark aerial image datasets (Potsdam and Vaihingen), we demonstrate the effectiveness of our approach in achieving segmentation results in line with similar works on few-shot segmentation while significantly reducing the annotation effort required, thereby highlighting its practical applicability. Overall, the combination of SimCLRv2 embeddings and clustering techniques presents a promising avenue for achieving accurate image segmentation while minimizing the annotation burden, making it highly relevant for remote sensing applications and aerial imagery analysis.
|