DIResUNet: Architecture for multiclass semantic segmentation of high resolution remote sensing imagery data

dc.contributor.authorPriyanka
dc.contributor.authorSravya, N.
dc.contributor.authorLal, S.
dc.contributor.authorNalini, J.
dc.contributor.authorChintala, C.S.
dc.contributor.authorDell’Acqua, F.
dc.date.accessioned2026-02-04T12:27:43Z
dc.date.issued2022
dc.description.abstractScene understanding is an important task in information extraction from high-resolution aerial images, an operation which is often involved in remote sensing applications. Recently, semantic segmentation using deep learning has become an important method to achieve state-of-the-art performance in pixel-level classification of objects. This latter is still a challenging task due to large pixel variance within classes possibly coupled with small pixel variance between classes. This paper proposes an artificial-intelligence (AI)-based approach to this problem, by designing the DIResUNet deep learning model. The model is built by integrating the inception module, a modified residual block, and a dense global spatial pyramid pooling (DGSPP) module, in combination with the well-known U-Net scheme. The modified residual blocks and the inception module extract multi-level features, whereas DGSPP extracts contextual intelligence. In this way, both local and global information about the scene are extracted in parallel using dedicated processing structures, resulting in a more effective overall approach. The performance of the proposed DIResUNet model is evaluated on the Landcover and WHDLD high resolution remote sensing (HRRS) datasets. We compared DIResUNet performance with recent benchmark models such as U-Net, UNet++, Attention UNet, FPN, UNet+SPP, and DGRNet to prove the effectiveness of our proposed model. Results show that the proposed DIResUNet model outperforms benchmark models on two HRRS datasets. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
dc.identifier.citationApplied Intelligence, 2022, 52, 13, pp. 15462-15482
dc.identifier.issn0924669X
dc.identifier.urihttps://doi.org/10.1007/s10489-022-03310-z
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/22400
dc.publisherSpringer
dc.subjectAntennas
dc.subjectBenchmarking
dc.subjectDeep learning
dc.subjectLearning systems
dc.subjectPixels
dc.subjectSemantic Segmentation
dc.subjectSemantics
dc.subjectBenchmark models
dc.subjectHigh resolution remote sensing
dc.subjectHigh resolution remote sensing imagery
dc.subjectPerformance
dc.subjectRemote-sensing
dc.subjectResidual block and inception module
dc.subjectSemantic segmentation
dc.subjectSpatial pyramid pooling
dc.subjectSpatial pyramids
dc.subjectRemote sensing
dc.titleDIResUNet: Architecture for multiclass semantic segmentation of high resolution remote sensing imagery data

Files

Collections