Skip to main navigation menu Skip to main content Skip to site footer

HSA-CNN: A Hybrid Spectral-Attention Multi-Agent Framework for Explainable Cloud Detection in Multispectral Remote Sensing Imagery

Abstract

Cloud detection is an essential preprocessing step in a remote sensing application. However, the presence of clouds and their shadows severely hamper the accuracy of the surface observations and the subsequent analysis. Reliable identification of thin clouds and cloud shadows is still a problem to which even state-of-the-art deep learning-based cloud detection methods have not provided a solution, because of spectral ambiguity, spatial variability, and the lack of awareness of uncertainty of the models. This work presents HSA-CNN, a hybrid spectral, attention, multi-agent deep learning framework that accurately and explainably identifies pixel-wise clouds in multispectral satellite imagery. The proposed architecture is a U-Net-based encoder-decoder architecture that is complemented by SpectralDirectionalKernel (SDK) blocks for multi-scale feature extraction, and integrates a set of specialized agents, including a transformer-based spectral attention agent, a MobileNet-based spatial context agent, a bidirectional LSTM-based temporal sequence agent, and a Bayesian uncertainty agent. This meta-agent orchestration mechanism performs confidence-aware, per-pixel expert selection and ensemble fusion, enabling robust predictions and reliable uncertainty estimation. The experimental results show that HSA and CNN can accurately classify four cloud categories: clear sky, thick cloud, thin cloud, and cloud shadow. Moreover, it significantly improves thin cloud discrimination and prediction stability. Furthermore, the framework can provide interpretable outputs via attention maps, agent-weight visualizations, and pixel-level uncertainty maps, which improve transparency and operational trust. The proposed method is a powerful and interpretable tool in remote sensing that can be employed for atmospheric correction, environmental monitoring, and climate analysis.

Keywords

Cloud Detection, Multispectral Remote Sensing, Deep Learning, Multi-Agent Systems, Spectral Attention, Explainable Artificial Intelligence

PDF

References

  1. Z. Zhu and C. E. Woodcock, “Object-based cloud and cloud shadow detection in Landsat imagery,” Remote Sens. Environ., vol. 118, pp. 83–94, 2012, doi: 10.1016/j.rse.2011.10.028.
  2. N. Ma, L. Sun, C. Zhou, and Y. He, “Cloud detection for multi-satellite imagery based on spectral library and convolutional neural network,” Remote Sens., vol. 13, no. 16, p. 3319, 2021, doi: 10.3390/rs13163319.
  3. J. Yang, J. Guo, H. Yue, Z. Liu, H. Hu, and K. Li, “CDnet: CNN-based cloud detection for remote sensing imagery,” IEEE Trans. Geosci. Remote Sens., vol. 57, no. 8, pp. 6195–6211, Aug. 2019, doi: 10.1109/TGRS.2019.2904868.
  4. Z. Li, H. Shen, Q. Cheng, Y. Liu, S. You, and Z. He, “Deep learning based cloud detection for medium and high resolution remote sensing images of different sensors,” ISPRS J. Photogramm. Remote Sens., vol. 150, pp. 197–212, 2019, doi: 10.1016/j.isprsjprs.2019.02.017.
  5. Z. Wang, L. Zhao, J. Meng, Y. Han, X. Li, R. Jiang, J. Chen, and H. Li, “Deep learning-based cloud detection for optical remote sensing images: A survey,” Remote Sens., vol. 16, p. 4583, 2024, doi: 10.3390/rs16234583.
  6. R. Irish, J. Barker, S. Goward, and T. Arvidson, “Characterization of the Landsat-7 ETM+ automated cloud-cover assessment (ACCA) algorithm,” Photogramm. Eng. Remote Sens., vol. 72, pp. 1179–1188, 2006, doi: 10.14358/PERS.72.10.1179.
  7. L. Murino, U. Amato, M. F. Carfora, A. Antoniadis, B. Huang, W. Menzel, and C. Serio, “Cloud detection of MODIS multispectral images,” J. Atmos. Ocean. Technol., vol. 31, 2014, doi: 10.1175/JTECH-D-13-00088.1.
  8. B. Waske and J. Benediktsson, “Fusion of support vector machines for classification of multisensor data,” IEEE Trans. Geosci. Remote Sens., vol. 45, pp. 3858–3866, 2007, doi: 10.1109/TGRS.2007.898446.
  9. G. Camps-Valls, D. Tuia, L. Bruzzone, and J. A. Benediktsson, “Advances in hyperspectral image classification: Earth monitoring with statistical learning methods,” IEEE Signal Process. Mag., vol. 31, no. 1, pp. 45–54, 2014, doi: 10.1109/MSP.2013.2279179.
  10. G. Camps-Valls, “Machine learning in remote sensing data processing,” in Proc. IEEE Signal Process. Soc. Workshop Mach. Learn. Signal Process. (MLSP), 2009, doi: 10.1109/MLSP.2009.5306233.
  11. R. Singh, M. Biswas, and M. Pal, “Cloud detection using Sentinel-2 imageries: A comparison of XGBoost, RF, SVM, and CNN algorithms,” Geocarto Int., vol. 38, no. 1, pp. 1–32, 2023, doi: 10.1080/10106049.2022.2146211.
  12. J. Yang, J. Guo, H. Yue, Z. Liu, H. Hu, and K. Li, “CDnet: CNN-based cloud detection for remote sensing imagery,” IEEE Trans. Geosci. Remote Sens., vol. 57, no. 8, pp. 6195–6211, 2019, doi: 10.1109/TGRS.2019.2904868.
  13. S. Ghaffarian, J. Valente, M. van der Voort, and B. Tekinerdogan, “Effect of attention mechanism in deep learning-based remote sensing image processing: A systematic literature review,” Remote Sens., vol. 13, p. 2965, 2021, doi: 10.3390/rs13152965.
  14. Y. Wu, B. Li, J. Li, Y. Liang, N. Zhang, and A. Sun, “Enhancing nighttime cloud detection for moderate resolution imagers using a transformer-based deep learning network,” Remote Sens. Environ., vol. 332, p. 115067, 2026, doi: 10.1016/j.rse.2025.115067.
  15. K. H. Tran, X. Zhang, H. K. Zhang, Y. Shen, Y. Ye, Y. Liu, S. Gao, and S. An, “A transformer-based model for detecting land surface phenology from the irregular harmonized Landsat and Sentinel-2 time series across the United States,” Remote Sens. Environ., vol. 320, p. 114656, 2025, doi: 10.1016/j.rse.2025.114656.
  16. A. A. Aleissaee, A. Kumar, R. M. Anwer, S. Khan, H. Cholakkal, G.-S. Xia, and F. S. Khan, “Transformers in remote sensing: A survey,” Remote Sens., vol. 15, p. 1860, 2023, doi: 10.3390/rs15071860.
  17. A. Kendall and Y. Gal, “What uncertainties do we need in Bayesian deep learning for computer vision?” in Proc. 31st Int. Conf. Neural Inf. Process. Syst. (NIPS), 2017, pp. 5580–5590.
  18. Y. Gal and Z. Ghahramani, “Dropout as a Bayesian approximation: Representing model uncertainty in deep learning,” in Proc. 33rd Int. Conf. Mach. Learn. (ICML), vol. 48, pp. 1050–1059, 2016.
  19. U. Sinha and K. P. Swain, “Global Cloud Pattern Database for Earth Observation,” Kaggle, [Online]. Available: https://www.kaggle.com/datasets/ujjwalsinha01/global-cloud-pattern-database-for-earthobservation

Downloads

Download data is not yet available.

Similar Articles

41-50 of 102

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)