自然场景下基于四级级联全卷积神经网络的人脸检测算法

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

自然场景下基于四级级联全卷积神经网络的人脸检测算法

详细信息查看全文 | 推荐本文 |

英文篇名：Face Detection Based on Full Convolution Neural Network of Four-level Cascading in Natural Scene
作者：石学超 ; 周亚同 ; 韩卫雪
英文作者：SHI Xuechao;ZHOU Yatong;HAN Weixue;School of Electronic and Information Engineering,Hebei University of Technology;
关键词：人脸检测 ; 4级级联网络 ; 全卷积网络 ; 自举训练 ; 深度学习
英文关键词：face detection;;four-level cascade network;;full convolution network;;bootstrap training;;deep learning
中文刊名：TDXB
英文刊名：Journal of the China Railway Society
机构：河北工业大学电子信息工程学院;
出版日期：2019-01-15
出版单位：铁道学报
年：2019
期：v.41;No.255
基金：中国博士后科学基金(2014M561053);; 河北省自然科学基金(F2013202254);; 2015年度教育部人文社会科学研究项目(15YJA630108)
语种：中文;
页：TDXB201901013
页数：7
CN：01
ISSN：11-2104/U
分类号：86-92

摘要

针对于自然场景下人脸检测存在的姿态复杂、遮挡和光照等问题,提出一种基于4级级联全卷积神经网络的人脸检测算法。构建4级级联网络,采用级联分级训练代替端到端训练,以避免只共享1个网络权值的局限,进而获得有区分性功能的深度网络,提高检测精度;每级深度网络结构均采用全卷积结构,可以接受任意尺寸图像的输入,提高检测效率;另外在训练过程采用自举法Bootstrap进行网络模型的优化训练,提高训练样本利用率;利用最终训练好的深度卷积网络模型实现人脸检测。人脸检测实验结果标明,本算法在自然场景下,对多姿态、遮挡、单图多种人脸类型等均具有良好的鲁棒性,同时在现有平台上每张图片的检测速度达到96ms,在国际权威的人脸检测公开测试集FDDB上的"真正率"达到82.98%。
Aiming at the problem of various facial gestures,occlusions and illumination in human face detection in natural scene,a face detection algorithm was proposed based on full convolution neural network of four-level cascading.Firstly,a four-level cascade network was constructed.A cascade training was used instead of endto-end training to avoid the limitation of sharing only one network weight.A deep network with differentiated functions can be obtained,to improve the detection accuracy.Secondly,each level of depth network architecture using full convolution network structure can accept any size of image input to improve the detection performance and efficiency.During the training process,the bootstrap method was used to optimize the training of the network model,to improve the training sample utilization rate.Finally,the trained deep convolution network model was used to achieve face detection.The experimental results show that the face detection algorithm has good robustness in the natural scene,such as multi-gesture,occlusion,single figure multiple faces.On the existing platform,the speed of each picture detection reached 96 ms.The accuracy on the international authoritative face detection data set(FDDB)benchmark is 82.98%.

引文

[1]LECUN Y,BENGIO Y,HINTON G.Deep Learning[J].Nature,2015,521:436-444.
    [2]HJELMAS E,LOW B K.Face Detection:a Survey[J].Computer Vision and Image Understanding,2001,83(3):236-274.
    [3]YANG M H,KRIEGMAN D,AHUJA N.Detecting Faces in Images:a Survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(1):34-58.
    [4]TURK M,PENTLAND A.Eigenfaces for Recognition[J].Journal of Cognitive Neuroscience,1991,3(1):71-86.
    [5]OSUNA E,FREUND R,GIROSIT F.Training Support Vector Machines:an Application to Face Detection[C]//Proceedings of IEEE Computer Society Conference on Computer Version and Pattern Recognition.New York:IEEE,1997:130-136.
    [6]ZHANG S,CAI Y,XIE M.Face Detection Based on Local Region Sparse Coding[J].Journal of Software,2013,24(11):2747-2757.
    [7]VIOLA P,JONES M J.Robust Real-time Face Detection[J].International Journal of Computer Vision,2004,57(2):137-154.
    [8]VIOLA P,JONES M J.Rapid Object Detection Using a Boosted Cascade of Simple Features[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.New York:IEEE,2001:501-511.
    [9]OSADCHY M,CUN Y L,MILLER M L.Synergistic Face Detection and Pose Estimation with Energy-based Models[J].The Journal of Machine Learning Research,2007,8(16):1197-1215.
    [10]KALINOVSKII I,SPITSYN V.Compact Convolutional Neural Network Cascade for Face Detection[J].Neural Computation,2016,18(7):1527-1544.
    [11]FARFADE S S,SABERIAN M J,LI L J.Multi-view Face Detection Using Deep Convolutional Neural Networks[C]//Proceedings of the 5th ACM on International Conference on Multimedia Retrieval.New York:ACM,2015:643-650.
    [12]JIANG H,LEARNED-MILLER E.Face Detection with the Faster R-CNN[J].IEEE Computational Intelligence Magazine,2016,5(4):13-21
    [13]LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2015:3431-3440.
    [14]SHRIVASTAVA A,GUPTA A,GIRSHICK R.Training Region-based Object Detectors with Online Hard Example Mining[C]//Proceedings of IEEE Internatial Conference on Computer Vision and Pattern Recognition.New York:IEEE,2016:761-769.
    [15]LECUN Y,BOTTOU L,ORR G B,et al.Efficient Backprop[M].Berlin:Springer,2012:9-48.
    [16]XU L,CHOY C S,LI Y W.Deep Sparse Rectifier Neural Networks for Speech Denoising[C]//IEEE International Workshop on Acoustic Signal Enhancement.New York:IEEE,2016,5(4):13-21
    [17]CHEN D,REN S,WEI Y,et al.Joint Cascade Face Detection and Alignment[C]//European Conference on Computer Vision(ECCV 2014).Zhrich:Springer International Publishing,2014:109-122.
    [18]LI H,LIN Z,SHEN X,et al.A Convolutional Neural Network Cascade for Face Detection[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2015:5325-5334.
    [19]YAN J,LEI Z,WEN L.The Fastest Deformable Part Model for Object Detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.New York:IEEE,2014:2497-2504.
    [20]MATHIAS M,BENENSON R,PEDERSOLI M.Face Detection without Bells and Whistles[C]//European Conference on Computer Vision.Zhrich:Springer,2014:720-735.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700