摘要
针对于自然场景下人脸检测存在的姿态复杂、遮挡和光照等问题,提出一种基于4级级联全卷积神经网络的人脸检测算法。构建4级级联网络,采用级联分级训练代替端到端训练,以避免只共享1个网络权值的局限,进而获得有区分性功能的深度网络,提高检测精度;每级深度网络结构均采用全卷积结构,可以接受任意尺寸图像的输入,提高检测效率;另外在训练过程采用自举法Bootstrap进行网络模型的优化训练,提高训练样本利用率;利用最终训练好的深度卷积网络模型实现人脸检测。人脸检测实验结果标明,本算法在自然场景下,对多姿态、遮挡、单图多种人脸类型等均具有良好的鲁棒性,同时在现有平台上每张图片的检测速度达到96ms,在国际权威的人脸检测公开测试集FDDB上的"真正率"达到82.98%。
Aiming at the problem of various facial gestures,occlusions and illumination in human face detection in natural scene,a face detection algorithm was proposed based on full convolution neural network of four-level cascading.Firstly,a four-level cascade network was constructed.A cascade training was used instead of endto-end training to avoid the limitation of sharing only one network weight.A deep network with differentiated functions can be obtained,to improve the detection accuracy.Secondly,each level of depth network architecture using full convolution network structure can accept any size of image input to improve the detection performance and efficiency.During the training process,the bootstrap method was used to optimize the training of the network model,to improve the training sample utilization rate.Finally,the trained deep convolution network model was used to achieve face detection.The experimental results show that the face detection algorithm has good robustness in the natural scene,such as multi-gesture,occlusion,single figure multiple faces.On the existing platform,the speed of each picture detection reached 96 ms.The accuracy on the international authoritative face detection data set(FDDB)benchmark is 82.98%.
引文
[1]LECUN Y,BENGIO Y,HINTON G.Deep Learning[J].Nature,2015,521:436-444.
[2]HJELMAS E,LOW B K.Face Detection:a Survey[J].Computer Vision and Image Understanding,2001,83(3):236-274.
[3]YANG M H,KRIEGMAN D,AHUJA N.Detecting Faces in Images:a Survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(1):34-58.
[4]TURK M,PENTLAND A.Eigenfaces for Recognition[J].Journal of Cognitive Neuroscience,1991,3(1):71-86.
[5]OSUNA E,FREUND R,GIROSIT F.Training Support Vector Machines:an Application to Face Detection[C]//Proceedings of IEEE Computer Society Conference on Computer Version and Pattern Recognition.New York:IEEE,1997:130-136.
[6]ZHANG S,CAI Y,XIE M.Face Detection Based on Local Region Sparse Coding[J].Journal of Software,2013,24(11):2747-2757.
[7]VIOLA P,JONES M J.Robust Real-time Face Detection[J].International Journal of Computer Vision,2004,57(2):137-154.
[8]VIOLA P,JONES M J.Rapid Object Detection Using a Boosted Cascade of Simple Features[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.New York:IEEE,2001:501-511.
[9]OSADCHY M,CUN Y L,MILLER M L.Synergistic Face Detection and Pose Estimation with Energy-based Models[J].The Journal of Machine Learning Research,2007,8(16):1197-1215.
[10]KALINOVSKII I,SPITSYN V.Compact Convolutional Neural Network Cascade for Face Detection[J].Neural Computation,2016,18(7):1527-1544.
[11]FARFADE S S,SABERIAN M J,LI L J.Multi-view Face Detection Using Deep Convolutional Neural Networks[C]//Proceedings of the 5th ACM on International Conference on Multimedia Retrieval.New York:ACM,2015:643-650.
[12]JIANG H,LEARNED-MILLER E.Face Detection with the Faster R-CNN[J].IEEE Computational Intelligence Magazine,2016,5(4):13-21
[13]LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2015:3431-3440.
[14]SHRIVASTAVA A,GUPTA A,GIRSHICK R.Training Region-based Object Detectors with Online Hard Example Mining[C]//Proceedings of IEEE Internatial Conference on Computer Vision and Pattern Recognition.New York:IEEE,2016:761-769.
[15]LECUN Y,BOTTOU L,ORR G B,et al.Efficient Backprop[M].Berlin:Springer,2012:9-48.
[16]XU L,CHOY C S,LI Y W.Deep Sparse Rectifier Neural Networks for Speech Denoising[C]//IEEE International Workshop on Acoustic Signal Enhancement.New York:IEEE,2016,5(4):13-21
[17]CHEN D,REN S,WEI Y,et al.Joint Cascade Face Detection and Alignment[C]//European Conference on Computer Vision(ECCV 2014).Zhrich:Springer International Publishing,2014:109-122.
[18]LI H,LIN Z,SHEN X,et al.A Convolutional Neural Network Cascade for Face Detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2015:5325-5334.
[19]YAN J,LEI Z,WEN L.The Fastest Deformable Part Model for Object Detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2014:2497-2504.
[20]MATHIAS M,BENENSON R,PEDERSOLI M.Face Detection without Bells and Whistles[C]//European Conference on Computer Vision.Zhrich:Springer,2014:720-735.