Referring Image Segmentation for Remote Sensing Data
MCML Authors
Abstract
Abstract
In this paper, we present a new task: referring image segmentation for remote sensing data, which targets segmenting out specific objects referred to by natural language. Due to the absence of a dataset for this task, we construct a dataset based on the SkyScapes dataset. Our dataset is designed with linguistically structured expressions that focus on object categories, attributes, and spatial relationships, enabling the generation of binary masks from semantic segmentation maps. To benchmark this task, we evaluate and compare the performance of three different convolutional neural network (CNN)-based methods and a Transformer-based method. Experimental results provide valuable insights into the adaptability of these methods to remote sensing data, highlighting the potential of our dataset as a resource for the remote sensing community to further explore vision-language tasks.
inproceedings YMH+24a
IGARSS 2024
IEEE International Geoscience and Remote Sensing Symposium. Athens, Greece, Jun 07, 2024-12, 2023.Authors
Z. Yuan • L. Mou; • Y. Hua • X. ZhuLinks
DOIResearch Area
BibTeXKey: YMH+24a