Yefan Xie
Binary quantization vision transformer for effective segmentation of red tide in multi-spectral remote sensing imagery.
Xie, Yefan; Hou, Xuan; Ren, Jinchang; Zhang, Xinchao; Ma, Chengcheng; Zheng, Jiangbin
Authors
Xuan Hou
Professor Jinchang Ren j.ren@rgu.ac.uk
Professor of Computing Science
Xinchao Zhang
Chengcheng Ma
Jiangbin Zheng
Abstract
As a global marine disaster, red tides pose serious threats to marine ecology and the blue economy, making their monitoring crucial for preventing harmful algal blooms and protecting the marine environment. In this study, satellite remote sensing was utilized to provide timely, large-scale, and continuous observation capabilities, overcoming the high cost and spatial and temporal limitations of in-situ monitoring. However, existing remote sensing-based methods often exhibit coarse segmentation granularity and suffer from high computational complexity. To overcome these challenges, we propose a novel bi-modal multispectral dynamic offset binary quantization visual transformer (DoBi-SWiP-ViT) that utilizes the ViT for global feature aggregation and parameter quantization for efficient segmentation. With the Bi-modal Swin-ViT with Unified Perceptual Parsing architecture, our model integrates data from multiple spectral bands to achieve fine-grained segmentation of large-scale remote sensing images. Additionally, we introduce a dynamic magnitude offset binary quantization ViT block to reduce the parameter redundancy and improve the computational efficiency. In addition, we validated the performance of our model through extensive comparative experiments on high-resolution imagery datasets of sea surface red tides collected from different satellite platforms. The results show that our proposed DoBi-SWiP-ViT has significantly improved the mean accuracy (mAcc) of the segmentation results. For the two test areas acquired from different satellite platforms, the improvements are 8.78% and 10.18%, respectively. This has demonstrated the superior performance of our model in detecting the red tides from high-resolution visible images, highlighting its effectiveness in capturing complex patterns and subtle features in multi-spectral imagery.
Citation
XIE, Y., HOU, X., REN, J., ZHANG, X., MA, C. and ZHENG, J. 2025. Binary quantization vision transformer for effective segmentation of red tide in multi-spectral remote sensing imagery. IEEE Transactions on geoscience and remote sensing [online], 63, article number 4202814. Available from: https://doi.org/10.1109/TGRS.2025.3540784
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 11, 2025 |
Online Publication Date | Feb 11, 2025 |
Publication Date | Dec 31, 2025 |
Deposit Date | Feb 14, 2025 |
Publicly Available Date | Feb 14, 2025 |
Journal | IEEE Transactions on geoscience and remote sensing |
Print ISSN | 0196-2892 |
Electronic ISSN | 1558-0644 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Peer Reviewed | Peer Reviewed |
Volume | 63 |
Article Number | 4202814 |
DOI | https://doi.org/10.1109/TGRS.2025.3540784 |
Keywords | Red tide; Segmentation; Binary quantization; Vision transformer; Remote sensing; Multi-spectral imagery |
Public URL | https://rgu-repository.worktribe.com/output/2702501 |
Files
XIE 2025 Binary quantization vision (AAM)
(5.6 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
Copyright Statement
© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Two-click based fast small object annotation in remote sensing images.
(2024)
Journal Article
Prompting-to-distill semantic knowledge for few-shot learning.
(2024)
Journal Article
Detection-driven exposure-correction network for nighttime drone-view object detection.
(2024)
Journal Article
Feature aggregation and region-aware learning for detection of splicing forgery.
(2024)
Journal Article
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search