研究目的
To propose a novel dense pyramid network (DPN) for semantic segmentation of high-resolution aerial imagery, addressing the challenges of large intra-class variance and small inter-class differences.
研究成果
The proposed DPN architecture, incorporating group convolutions, channel shuffle operation, densely connected convolutions, and a pyramid pooling module, demonstrated superior performance in semantic segmentation of high-resolution aerial images compared to state-of-the-art methods. The median frequency balanced focal loss effectively addressed the class imbalance problem.
研究不足
The paper does not explicitly mention limitations, but potential areas for optimization could include computational efficiency and the handling of very high-resolution images.
1:Experimental Design and Method Selection:
The DPN architecture includes group convolutions and channel shuffle operation for multi-sensor data feature preservation, densely connected convolutions for high-level semantic feature extraction, and pyramid pooling operation for multi-sensor and multi-resolution feature fusion.
2:Sample Selection and Data Sources:
The ISPRS Vaihingen and Potsdam 2D semantic labeling datasets were used for evaluation.
3:List of Experimental Equipment and Materials:
Not explicitly mentioned.
4:Experimental Procedures and Operational Workflow:
The network was trained using an Adam optimizer with a median frequency balanced focal loss function. Data augmentation was applied to mitigate overfitting.
5:Data Analysis Methods:
The performance was evaluated based on the proportion of correctly labeled pixels and the per-class F1 score.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容