Skip to main content

Research Repository

Advanced Search

Binary quantization vision transformer for effective segmentation of red tide in multi-spectral remote sensing imagery.

Xie, Yefan; Hou, Xuan; Ren, Jinchang; Zhang, Xinchao; Ma, Chengcheng; Zheng, Jiangbin

Authors

Yefan Xie

Xuan Hou

Xinchao Zhang

Chengcheng Ma

Jiangbin Zheng



Abstract

As a global marine disaster, red tides pose serious threats to marine ecology and the blue economy, making their monitoring crucial for preventing harmful algal blooms and protecting the marine environment. In this study, satellite remote sensing was utilized to provide timely, large-scale, and continuous observation capabilities, overcoming the high cost and spatial and temporal limitations of in-situ monitoring. However, existing remote sensing-based methods often exhibit coarse segmentation granularity and suffer from high computational complexity. To overcome these challenges, we propose a novel bi-modal multispectral dynamic offset binary quantization visual transformer (DoBi-SWiP-ViT) that utilizes the ViT for global feature aggregation and parameter quantization for efficient segmentation. With the Bi-modal Swin-ViT with Unified Perceptual Parsing architecture, our model integrates data from multiple spectral bands to achieve fine-grained segmentation of large-scale remote sensing images. Additionally, we introduce a dynamic magnitude offset binary quantization ViT block to reduce the parameter redundancy and improve the computational efficiency. In addition, we validated the performance of our model through extensive comparative experiments on high-resolution imagery datasets of sea surface red tides collected from different satellite platforms. The results show that our proposed DoBi-SWiP-ViT has significantly improved the mean accuracy (mAcc) of the segmentation results. For the two test areas acquired from different satellite platforms, the improvements are 8.78% and 10.18%, respectively. This has demonstrated the superior performance of our model in detecting the red tides from high-resolution visible images, highlighting its effectiveness in capturing complex patterns and subtle features in multi-spectral imagery.

Citation

XIE, Y., HOU, X., REN, J., ZHANG, X., MA, C. and ZHENG, J. 2025. Binary quantization vision transformer for effective segmentation of red tide in multi-spectral remote sensing imagery. IEEE Transactions on geoscience and remote sensing [online], 63, article number 4202814. Available from: https://doi.org/10.1109/TGRS.2025.3540784

Journal Article Type Article
Acceptance Date Feb 11, 2025
Online Publication Date Feb 11, 2025
Publication Date Dec 31, 2025
Deposit Date Feb 14, 2025
Publicly Available Date Feb 14, 2025
Journal IEEE Transactions on geoscience and remote sensing
Print ISSN 0196-2892
Electronic ISSN 1558-0644
Publisher Institute of Electrical and Electronics Engineers (IEEE)
Peer Reviewed Peer Reviewed
Volume 63
Article Number 4202814
DOI https://doi.org/10.1109/TGRS.2025.3540784
Keywords Red tide; Segmentation; Binary quantization; Vision transformer; Remote sensing; Multi-spectral imagery
Public URL https://rgu-repository.worktribe.com/output/2702501

Files

XIE 2025 Binary quantization vision (AAM) (5.6 Mb)
PDF

Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/

Copyright Statement
© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.




You might also like



Downloadable Citations