研究目的
To improve object segmentation and parsing by integrating multi-level features inspired by primate visual cortex and proposing enhanced Conditional Boltzmann Machine models.
研究成果
The proposed models achieve competitive results in object segmentation and parsing by leveraging multi-level features and structured models, with improvements in accuracy and generalization.
研究不足
High computational cost due to post-processing; curve correction does not improve accuracy significantly; small parts like tails are challenging to segment.
1:Experimental Design and Method Selection:
The study uses CNNs for low-level feature extraction and SRF for high-level feature extraction, with a multi-channel CBM to model relationships between features and labels.
2:Sample Selection and Data Sources:
Datasets include PASCAL VOC 2012, PennFudan Pedestrian Parsing, PPSS, Horse-Cow parsing, and PASCAL Quadrupeds.
3:List of Experimental Equipment and Materials:
Intel i7 CPU, NVIDIA Titan X GPU, Torch framework.
4:Experimental Procedures and Operational Workflow:
Features are extracted, input to CBM for training and inference, followed by superpixel and curve-based optimization.
5:Data Analysis Methods:
Performance evaluated using IoU and aPA metrics.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容-
NVIDIA Titan X GPU
Titan X
NVIDIA
Accelerated computing for deep learning tasks
-
Intel i7 CPU
i7
Intel
General-purpose computing for experiments
-
Torch
Deep learning framework for model training and inference
-
VGGNet
VGG16
Convolutional neural network for feature extraction
-
ResNet
ResNet101
Convolutional neural network for feature extraction
-
MCG
Multiscale combinatorial grouping for superpixel generation
-
gPb
Edge detection for curve correction
-
FastEdge
Edge detection using structured forests
-
登录查看剩余6件设备及参数对照表
查看全部