研究目的
Investigating the detection and reconstruction of buildings from a single aerial image using deep learning and convolutional neural networks.
研究成果
The proposed method utilizes the power of CNNs to extract the inherent and latent features from a single image and interpret them as 3D information for building reconstruction. The results over test datasets showed the reasonable performance of the proposed method in predicting height values with the average RMSE of 3.43 m and NMAD of 1.13 m. The precise boundaries of individual buildings are extracted with the accuracy of 95.8% and 88.4% for the Potsdam and Zeebrugge data, respectively. The result of 3D reconstruction was visually very promising, which was also numerically confirmed by the RMSE values of about 1.2 m and 0.8 m for the Potsdam data as well as 3.9 m and 2.4 m for the Zeebrugge data for the horizontal and vertical accuracies, respectively.
研究不足
The quality of the final 3D reconstruction highly depends on the quality and accuracy of the predicted linear elements as well as nDSMs. The most important challenges are trees decreasing the accuracy of the predicted eave lines, errors in the predicted ridge lines leading to tilted roofs being modeled as flat roofs, classification errors between the eave and ridge lines, and errors in the predicted nDSM affecting the median values of the eave lines.