Xuran Pan
Building extraction from high-resolution aerial imagery using a generative adversarial network with spatial and channel attention mechanisms.
Pan, Xuran; Yang, Fan; Gao, Lianru; Chen, Zhengchao; Zhang, Bing; Fan, Hairui; Ren, Jinchang
Authors
Fan Yang
Lianru Gao
Zhengchao Chen
Bing Zhang
Hairui Fan
Professor Jinchang Ren j.ren@rgu.ac.uk
Professor of Computing Science
Abstract
Segmentation of high-resolution remote sensing images is an important challenge with wide practical applications. The increasing spatial resolution provides fine details for image segmentation but also incurs segmentation ambiguities. In this paper, we propose a generative adversarial network with spatial and channel attention mechanisms (GAN-SCA) for the robust segmentation of buildings in remote sensing images. The segmentation network (generator) of the proposed framework is composed of the well-known semantic segmentation architecture (U-Net) and the spatial and channel attention mechanisms (SCA). The adoption of SCA enables the segmentation network to selectively enhance more useful features in specific positions and channels and enables improved results closer to the ground truth. The discriminator is an adversarial network with channel attention mechanisms that can properly discriminate the outputs of the generator and the ground truth maps. The segmentation network and adversarial network are trained in an alternating fashion on the Inria aerial image labeling dataset and Massachusetts buildings dataset. Experimental results show that the proposed GAN-SCA achieves a higher score (the overall accuracy and intersection over the union of Inria aerial image labeling dataset are 96.61% and 77.75%, respectively, and the F1-measure of the Massachusetts buildings dataset is 96.36%) and outperforms several state-of-the-art approaches.
Citation
PAN, X., YANG, F., GAO, L., CHEN, Z., ZHANG, B., FAN, H. and REN, J. 2019. Building extraction from high-resolution aerial imagery using a generative adversarial network with spatial and channel attention mechanisms. Remote sensing [online], 11(8), article 917. Available from: https://doi.org/10.3390/rs11080917
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 12, 2019 |
Online Publication Date | Apr 15, 2019 |
Publication Date | Apr 30, 2019 |
Deposit Date | May 2, 2022 |
Publicly Available Date | May 2, 2022 |
Journal | Remote Sensing |
Electronic ISSN | 2072-4292 |
Publisher | MDPI |
Peer Reviewed | Peer Reviewed |
Volume | 11 |
Issue | 8 |
Article Number | 917 |
DOI | https://doi.org/10.3390/rs11080917 |
Keywords | High-resolution aerial images; Deep learning; Generative adversarial network; Semantic segmentation; Inria aerial image labeling dataset; Massachusetts buildings dataset |
Public URL | https://rgu-repository.worktribe.com/output/1085608 |
Files
PAN 2019 Building extraction from high (VOR)
(7.2 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
Copyright Statement
© 2019 by the authors. Licensee MDPI, Basel, Switzerland.
You might also like
Two-click based fast small object annotation in remote sensing images.
(2024)
Journal Article
Prompting-to-distill semantic knowledge for few-shot learning.
(2024)
Journal Article
Detection-driven exposure-correction network for nighttime drone-view object detection.
(2024)
Journal Article
Feature aggregation and region-aware learning for detection of splicing forgery.
(2024)
Journal Article