研究目的
Investigating the capacity of deep-learning on LIDAR-based maps, considering the range and reflectance data, applied to pedestrian classification; evaluation of the classification performance of early and late fusion strategies, based on learning and deterministic fusion techniques.
研究成果
The fusion strategies attained the best results in comparison with the individual CNNs. The combination of camera and LIDAR data increases the classification performance, with the LIDAR’s reflectance maps showing very promising results.
研究不足
The study focuses on binary classification (pedestrian vs. non-pedestrian) and does not address the detection problem, which involves unknowns in position and size/scale. The dataset is unbalanced, with more non-pedestrian examples than pedestrian ones.
1:Experimental Design and Method Selection:
The study uses Convolutional Neural Networks (CNN) as classifiers in distinct situations, including single sensor data input and combined data from both sensors in the CNN input layer. Early and late multi-modal sensor fusion approaches are compared.
2:Sample Selection and Data Sources:
A 'binary classification' dataset is created from the KITTI Vision Benchmark Suite, consisting of pedestrian and non-pedestrian categories.
3:List of Experimental Equipment and Materials:
Data from a monocular camera and a 3D LIDAR sensor are used.
4:Experimental Procedures and Operational Workflow:
LIDAR point clouds are used to generate high-resolution depth and reflectance maps through a bilateral filter implementation.
5:Data Analysis Methods:
Performance measures such as F-score, recall, precision, and ROC curves (area under the curve-AUC) are used for evaluation.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容