Haoxue Zhang
Frequency-domain guided swin transformer and global-local feature integration for remote sensing images semantic segmentation.
Zhang, Haoxue; Xie, Gang; Li, Linjuan; Xie, Xinlin; Ren, Jinchang
Authors
Gang Xie
Linjuan Li
Xinlin Xie
Professor Jinchang Ren j.ren@rgu.ac.uk
Professor of Computing Science
Abstract
Convolutional Neural Networks (CNNs), transformers, and the hybrid methods have been significant application in remote sensing. However, existing methods are limited in effectively modeling frequency domain information, which affects their ability to capture detailed information. Therefore, we propose a frequency-domain guided feature coupled mechanism and a global-local feature integration method (FGNet) for semantic segmentation. Specifically, a frequency-domain guided Swin transformer (FGSwin) is designed by introducing dilation group convolution, Fast Fourier Transform (FFT) and learnable weights to enhance the expression capability of frequency-domain and space-domain, local and global features, simultaneously. In addition, a global-local feature integration module (GLFI) is proposed for aggregating features to further enhance the discrimination of each category. Comprehensive experimental results demonstrate that, compared to existing methods, the proposed method achieves superior performance in terms of mean intersection over union (mIoU), reaching 71.46% and 74.04% on the ISPRS Potsdam and Vaihingen, two widely used datasets.
Citation
ZHANG, H., XIE, G., LI, L., XIE, X. and REN, J. [2025]. Frequency-domain guided swin transformer and global-local feature integration for remote sensing images semantic segmentation. IEEE Transactions on geoscience and remote sensing [online], (Early Access). Available from: https://doi.org/10.1109/TGRS.2025.3535724
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 28, 2025 |
Online Publication Date | Jan 28, 2025 |
Deposit Date | Feb 13, 2025 |
Publicly Available Date | Feb 13, 2025 |
Journal | IEEE Transactions on geoscience and remote sensing |
Print ISSN | 0196-2892 |
Electronic ISSN | 1558-0644 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Peer Reviewed | Peer Reviewed |
DOI | https://doi.org/10.1109/TGRS.2025.3535724 |
Keywords | Remote sensing semantic segmentation; Global-local features; Frequency-domain guided Swin transformer; Feature integration |
Public URL | https://rgu-repository.worktribe.com/output/2675737 |
Files
ZHANG 2025 Frequency-domain guided swin (AAM)
(1.7 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
Copyright Statement
© 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Two-click based fast small object annotation in remote sensing images.
(2024)
Journal Article
Prompting-to-distill semantic knowledge for few-shot learning.
(2024)
Journal Article
Detection-driven exposure-correction network for nighttime drone-view object detection.
(2024)
Journal Article
Feature aggregation and region-aware learning for detection of splicing forgery.
(2024)
Journal Article
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search