研究目的
To present a novel multi-view convolutional neural network (CNN) model for 3D facial expression recognition (FER) that incorporates multi-view and facial prior information of the observed 3D face into the learning process.
研究成果
The proposed multi-view CNN model for 3D FER shows promising results compared with the state-of-the-art studies. The incorporation of face prior knowledge helps the network to learn the expression better by focusing on important regions. Future work includes studying the importance of each view for the recognition and extending the method to 4D data.
研究不足
The performance depends on the accuracy of the 3D facial landmark detection and region clustering. The method requires an accurate establishment of the dense correspondence among face models.