HSA-CNN: A Hybrid Spectral-Attention Multi-Agent Framework for Explainable Cloud Detection in Multispectral Remote Sensing Imagery

K.P. Swain; S.K. Mohapatra; Soumya Ranjan Nayak; Ashish Singh

doi:10.38094/jastt605741

2025
Special Issue: Remote Sensing based Intelligent Visual Analytics for Real-time Environmental and Earth Monitoring Systems

Special Issue: Remote Sensing based Intelligent Visual Analytics for Real-time Environmental and Earth Monitoring Systems

HSA-CNN: A Hybrid Spectral-Attention Multi-Agent Framework for Explainable Cloud Detection in Multispectral Remote Sensing Imagery

Published 2025-12-31

K.P. Swain
S.K. Mohapatra
Soumya Ranjan Nayak
Ashish Singh

K.P. Swain
Department of ETC, Trident Academy of Technology, Bhubaneswar, Odisha

S.K. Mohapatra
Department of Computer Science and Engineering (AI&ML), Dayananda Sagar University, School of Engineering, Bengaluru, Karnataka, India

Soumya Ranjan Nayak
School of Computer Engineering, KIIT Deemed To Be University, Bhubaneswar, Odisha

Ashish Singh
chool of Computer Engineering, KIIT Deemed To Be University, Bhubaneswar, Odisha

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

How to Cite

[1]

K. Swain, S. Mohapatra, S. R. Nayak, and A. Singh, “HSA-CNN: A Hybrid Spectral-Attention Multi-Agent Framework for Explainable Cloud Detection in Multispectral Remote Sensing Imagery”, JASTT, vol. 6, no. 05, pp. 201–221, Dec. 2025, doi: 10.38094/jastt605741.

Download Citation

Abstract

Cloud detection is an essential preprocessing step in a remote sensing application. However, the presence of clouds and their shadows severely hamper the accuracy of the surface observations and the subsequent analysis. Reliable identification of thin clouds and cloud shadows is still a problem to which even state-of-the-art deep learning-based cloud detection methods have not provided a solution, because of spectral ambiguity, spatial variability, and the lack of awareness of uncertainty of the models. This work presents HSA-CNN, a hybrid spectral, attention, multi-agent deep learning framework that accurately and explainably identifies pixel-wise clouds in multispectral satellite imagery. The proposed architecture is a U-Net-based encoder-decoder architecture that is complemented by SpectralDirectionalKernel (SDK) blocks for multi-scale feature extraction, and integrates a set of specialized agents, including a transformer-based spectral attention agent, a MobileNet-based spatial context agent, a bidirectional LSTM-based temporal sequence agent, and a Bayesian uncertainty agent. This meta-agent orchestration mechanism performs confidence-aware, per-pixel expert selection and ensemble fusion, enabling robust predictions and reliable uncertainty estimation. The experimental results show that HSA and CNN can accurately classify four cloud categories: clear sky, thick cloud, thin cloud, and cloud shadow. Moreover, it significantly improves thin cloud discrimination and prediction stability. Furthermore, the framework can provide interpretable outputs via attention maps, agent-weight visualizations, and pixel-level uncertainty maps, which improve transparency and operational trust. The proposed method is a powerful and interpretable tool in remote sensing that can be employed for atmospheric correction, environmental monitoring, and climate analysis.

Keywords

Cloud Detection, Multispectral Remote Sensing, Deep Learning, Multi-Agent Systems, Spectral Attention, Explainable Artificial Intelligence

PDF

References

Z. Zhu and C. E. Woodcock, “Object-based cloud and cloud shadow detection in Landsat imagery,” Remote Sens. Environ., vol. 118, pp. 83–94, 2012, doi: 10.1016/j.rse.2011.10.028.
N. Ma, L. Sun, C. Zhou, and Y. He, “Cloud detection for multi-satellite imagery based on spectral library and convolutional neural network,” Remote Sens., vol. 13, no. 16, p. 3319, 2021, doi: 10.3390/rs13163319.
J. Yang, J. Guo, H. Yue, Z. Liu, H. Hu, and K. Li, “CDnet: CNN-based cloud detection for remote sensing imagery,” IEEE Trans. Geosci. Remote Sens., vol. 57, no. 8, pp. 6195–6211, Aug. 2019, doi: 10.1109/TGRS.2019.2904868.
Z. Li, H. Shen, Q. Cheng, Y. Liu, S. You, and Z. He, “Deep learning based cloud detection for medium and high resolution remote sensing images of different sensors,” ISPRS J. Photogramm. Remote Sens., vol. 150, pp. 197–212, 2019, doi: 10.1016/j.isprsjprs.2019.02.017.
Z. Wang, L. Zhao, J. Meng, Y. Han, X. Li, R. Jiang, J. Chen, and H. Li, “Deep learning-based cloud detection for optical remote sensing images: A survey,” Remote Sens., vol. 16, p. 4583, 2024, doi: 10.3390/rs16234583.
R. Irish, J. Barker, S. Goward, and T. Arvidson, “Characterization of the Landsat-7 ETM+ automated cloud-cover assessment (ACCA) algorithm,” Photogramm. Eng. Remote Sens., vol. 72, pp. 1179–1188, 2006, doi: 10.14358/PERS.72.10.1179.
L. Murino, U. Amato, M. F. Carfora, A. Antoniadis, B. Huang, W. Menzel, and C. Serio, “Cloud detection of MODIS multispectral images,” J. Atmos. Ocean. Technol., vol. 31, 2014, doi: 10.1175/JTECH-D-13-00088.1.
B. Waske and J. Benediktsson, “Fusion of support vector machines for classification of multisensor data,” IEEE Trans. Geosci. Remote Sens., vol. 45, pp. 3858–3866, 2007, doi: 10.1109/TGRS.2007.898446.
G. Camps-Valls, D. Tuia, L. Bruzzone, and J. A. Benediktsson, “Advances in hyperspectral image classification: Earth monitoring with statistical learning methods,” IEEE Signal Process. Mag., vol. 31, no. 1, pp. 45–54, 2014, doi: 10.1109/MSP.2013.2279179.
G. Camps-Valls, “Machine learning in remote sensing data processing,” in Proc. IEEE Signal Process. Soc. Workshop Mach. Learn. Signal Process. (MLSP), 2009, doi: 10.1109/MLSP.2009.5306233.
R. Singh, M. Biswas, and M. Pal, “Cloud detection using Sentinel-2 imageries: A comparison of XGBoost, RF, SVM, and CNN algorithms,” Geocarto Int., vol. 38, no. 1, pp. 1–32, 2023, doi: 10.1080/10106049.2022.2146211.
J. Yang, J. Guo, H. Yue, Z. Liu, H. Hu, and K. Li, “CDnet: CNN-based cloud detection for remote sensing imagery,” IEEE Trans. Geosci. Remote Sens., vol. 57, no. 8, pp. 6195–6211, 2019, doi: 10.1109/TGRS.2019.2904868.
S. Ghaffarian, J. Valente, M. van der Voort, and B. Tekinerdogan, “Effect of attention mechanism in deep learning-based remote sensing image processing: A systematic literature review,” Remote Sens., vol. 13, p. 2965, 2021, doi: 10.3390/rs13152965.
Y. Wu, B. Li, J. Li, Y. Liang, N. Zhang, and A. Sun, “Enhancing nighttime cloud detection for moderate resolution imagers using a transformer-based deep learning network,” Remote Sens. Environ., vol. 332, p. 115067, 2026, doi: 10.1016/j.rse.2025.115067.
K. H. Tran, X. Zhang, H. K. Zhang, Y. Shen, Y. Ye, Y. Liu, S. Gao, and S. An, “A transformer-based model for detecting land surface phenology from the irregular harmonized Landsat and Sentinel-2 time series across the United States,” Remote Sens. Environ., vol. 320, p. 114656, 2025, doi: 10.1016/j.rse.2025.114656.
A. A. Aleissaee, A. Kumar, R. M. Anwer, S. Khan, H. Cholakkal, G.-S. Xia, and F. S. Khan, “Transformers in remote sensing: A survey,” Remote Sens., vol. 15, p. 1860, 2023, doi: 10.3390/rs15071860.
A. Kendall and Y. Gal, “What uncertainties do we need in Bayesian deep learning for computer vision?” in Proc. 31st Int. Conf. Neural Inf. Process. Syst. (NIPS), 2017, pp. 5580–5590.
Y. Gal and Z. Ghahramani, “Dropout as a Bayesian approximation: Representing model uncertainty in deep learning,” in Proc. 33rd Int. Conf. Mach. Learn. (ICML), vol. 48, pp. 1050–1059, 2016.
U. Sinha and K. P. Swain, “Global Cloud Pattern Database for Earth Observation,” Kaggle, [Online]. Available: https://www.kaggle.com/datasets/ujjwalsinha01/global-cloud-pattern-database-for-earthobservation

Downloads

Download data is not yet available.

HSA-CNN: A Hybrid Spectral-Attention Multi-Agent Framework for Explainable Cloud Detection in Multispectral Remote Sensing Imagery

Abstract

Keywords

References

Downloads

Similar Articles

Most read articles by the same author(s)

Similar Articles

Remote sensing as a tool of biological conservation and grassland monitoring in mountain areas of Southeastern Kazakhstan

The Simultaneous Localization and Mapping (SLAM)-An Overview

Federated Vision-Language Models for Privacy-Preserving Medical Image Analysis

Impact of land cover change on land surface temperature over Greater Beirut Area – Lebanon

Pancreatic Cancer Detection Using Quaternion Wavelet Transform and Squeeze-and-Excitation Network with SVM Classifier

Comparison Among Cloud Technologies and Cloud Performance

A Novel Architecture and Methodology to Detect Intrusions Against Edge-Based IIoT Using Machine Learning

Cloud Computing Virtualization of Resources Allocation for Distributed Systems

Feature-Based Child Mortality Prediction Using Ensemble and Traditional Machine Learning Models

Classification of Brain Tumor based on Machine Learning Algorithms: A Review