研究目的
To address the suboptimal performance of traditional deep models in hyperspectral image classification due to limited training samples, large intraclass variance, and low interclass variance by developing a CNN with multiscale convolution and diversified metric to extract discriminative features.
研究成果
The proposed DPP-DML-MS-CNN method effectively combines multiscale feature extraction with diversified deep metrics, significantly improving hyperspectral image classification performance. It outperforms traditional deep models and achieves comparable or better results than state-of-the-art methods across multiple datasets. Future work includes applying the method to other tasks like target detection and exploring different MS-CNN architectures.
研究不足
The training process is time-consuming, especially without pretraining. The method's performance may be affected by the initialization of weights, and excessively large diversity weights (λ) or neighbor sizes can lead to decreased accuracy. The approach is evaluated only on specific hyperspectral datasets and may not generalize to all types of remote sensing data.
1:Experimental Design and Method Selection:
The methodology involves designing three types of MS-CNNs for 1-D spectral, 2-D spectral-spatial, and 3-D spectral-spatial classification, incorporating multiscale filter banks inspired by the inception module in GoogLeNet. DPP-based diversity-promoting priors are imposed on deep metrics to encourage parameter factors to repulse from each other, reducing redundancy. A joint learning approach combines MS-CNNs and diversified deep metrics.
2:Sample Selection and Data Sources:
Four real-world hyperspectral image datasets are used: Pavia University, Indian Pines, Salinas Scene, and Kennedy Space Center (KSC). Training and testing samples are selected randomly, with specific numbers per class as detailed in Tables I-IV of the paper.
3:List of Experimental Equipment and Materials:
The experiments are conducted using a common machine with a 3.4-GHz Intel Core i7 processor and 64-GB memory. The Caffe framework is employed for implementation.
4:4-GHz Intel Core i7 processor and 64-GB memory. The Caffe framework is employed for implementation.
Experimental Procedures and Operational Workflow:
4. Experimental Procedures and Operational Workflow: The process includes transforming training samples into appropriate input forms for MS-CNNs, extracting multiscale features, applying metric transformation with DPP priors, and classifying using a Softmax classifier. Pretraining and fine-tuning are performed with stochastic gradient descent and backpropagation.
5:Data Analysis Methods:
Classification performance is evaluated using average accuracy (AA), overall accuracy (OA), and Kappa coefficient. Statistical significance is assessed using McNemar's test. The diversity of learned parameters is measured using the inverse cosine correlation.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容