研究目的
To address the inherent limitations of standard CNNs in modeling geometric transformations for object detection in optical remote sensing images by introducing deformable convolution into Faster R-CNN and improving detection performance through data augmentation and feature aggregation techniques.
研究成果
The deformable Faster R-CNN with Transfer Connection Block (TCB) and Random Covering data augmentation significantly improves object detection performance in optical remote sensing images, especially for small and partially occluded objects. Future work will explore the balance between the TCB module and computational efficiency.
研究不足
The study focuses on optical remote sensing images and may not generalize to other types of imagery. The effectiveness of Random Covering and deformable convolution in extremely occluded or cluttered scenes requires further investigation.
1:Experimental Design and Method Selection:
The study integrates deformable convolution into Faster R-CNN to model geometric transformations and employs top-down and skip connections for feature aggregation. A data augmentation technique named Random Covering is proposed to enhance model robustness to occlusion.
2:Sample Selection and Data Sources:
Experiments are conducted on the NWPU VHR-10, SORSI, and HRRS datasets, which include annotated optical remote sensing images with various object categories.
3:List of Experimental Equipment and Materials:
The experiments are performed using Caffe on Intel i7-6700K CPU and NVIDIA GTX1080 GPU, with ResNet-50 as the backbone network.
4:Experimental Procedures and Operational Workflow:
The deformable Faster R-CNN is trained with alternating training strategy, using pre-training model ResNet-50 for initialization. Random Covering is applied during training to simulate occlusion.
5:Data Analysis Methods:
Performance is evaluated using Precision-Recall Curve (PRC) and Average Precision (AP), with Non-Maximum Suppression (NMS) for redundancy reduction.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容