智能视频监控系统中若干检测与跟踪算法的研究

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

智能视频监控系统中若干检测与跟踪算法的研究

详细信息本馆镜像全文| 推荐本文 | | 获取CNKI官网全文

英文题名：Research of Several Detection and Tracking Algorithms in Intelligent Video Surveillance System
作者：谢迪
论文级别：博士
学科专业名称：计算机科技与技术
中文关键词：视频监控 ; 火焰检测 ; 人头检测 ; 人体上半身检测与分割 ; 目标跟踪 ; 人工神经网络 ; 傅立叶变换 ; 混合高斯模型 ; 有向梯度直方图 ; 贝叶斯后验概率 ; 似然函数 ; 推土机距离 ; SURF特征点
英文关键词：Video surveillance ; fire detection ; head detection ; human upper body
英文关键词：detection and segmentation ; object tracking ; ANN ; Fourier transformation ; GMM ; HOG Bayesian posterior ; likelihood ; EMD ; SURF
学位年度：2012
导师：董金祥 ; 童若锋
学科代码：0812
学位授予单位：浙江大学
论文提交日期：2012-04-01
答辩委员会主席：汪国昭

摘要

随着电子信息技术、计算机软硬件技术的不断发展,视频监控系统已经在城市化进程中体现出日益重要的价值,其应用早已渗透到政治、军事、文化、金融、科技等各个领域。智能算法作为视频监控系统的重要组成部分,已经在安防领域中发挥了巨大的作用。同时,智能算法也是视频监控系统相比其它形式的监控系统具有更高的性价比,更适合现代化发展的关键因素。因而,具有重要的研究意义。
     本文围绕着智能视频监控系统中的若干应用,重点研究了火焰检测、人头检测及人体上半身检测与分割、目标跟踪等三个方面的问题。论文工作包括：
     1.火焰检测方面,提出了一种新的基于人工神经网络的视频火焰检测方法。该方法在分析火焰的运动和颜色特征以外,还研究并利用了火焰的闪烁频率、几何形状等时空域特征,并用所获得的各类特征作为人工神经网络的输入,输出一个经过综合判断的结果。提出了针对快速傅立叶变换的GPU加速算法。所提方法能够区分闪烁的车灯与真实的火焰,并获得了较高的检测准确率。
     2.人头检测及人体上半身检测与分割方面,分别提出了有向梯度直方图与二维形状直方图这两个特征。在此基础上,针对人头检测,进一步提出了基于贝叶斯决策论的运动与外观似然的滤波方法；针对人体上半身检测,进一步提出了结合能量函数最优化与背景剔除技术的前景对象分割方法。在计算有向梯度直方图特征上,提出一种基于CUDA的GPU加速算法。所提的人头检测方法有效地降低了检测的误检率,而人体上半身检测与分割方法则能够准确提取位于前景部分的人体区域。
     3.目标跟踪方面,提出了一种基于推土机距离与SURF特征点的目标跟踪算法。提出了把解基于SURF特征点的跟踪问题规约到解推土机距离的线性规划问题上的思想。另外,提出了分两阶段由粗到精的跟踪方法和基于贝叶斯概率理论的多个目标对象发生遮挡时的处理方法。所提跟踪方法在实际视频监控系统中能够有效地对多个目标进行长时间跟踪,并具有较高的鲁棒性与可靠性。
     本文所提出的三个视频监控系统智能算法经实验证明是可行的,并且其中的若干思想已经和实际的应用进行了结合,与现有的监控系统进行了集成,取得了一定的实用效果。
With development of informantion and computer hardware-software technologies, video surveillance system has revealed its importance in the urbanization process. The range of its applications includes various fields such as politics, military, culture, finance and technology. Specially, the application of computer vision algorithms in surveillance system is a crucial premise of its intellectualization. It is also the main reason of the superiority of video-based surveillance system compared with other surveillance system, which has shown higher cost performance and more appropriateness for modern development. In conclusion, the research to computer vision in surveillance system has very important significance.
     In this paper, we choose several most representative intelligent applications in video surveillance field and carry on research about the algorithms to solve with them. It includes three aspects:fire detection, head detection and human upper body detection and segmentation, objects tracking. In details:
     1. In the field of fire detection, we propose a novel video fire detection method based on artificial neural network. Except for analyzing fire's motion and color features, the proposed method researches and utilizes tempral and special features such as fire's flickering frequency and geometry. All these extracted features are fed into an artificial neural network and the network outputs an integrated result. Moreover, we propose GPU based fast Fourier transformation algorithm. The proposed method can distinguish between flickering vehicle light and real fire, which results higher detection rate.
     2. In the field of head detection and human upper body detection and segmentation, we propose histogram of oriented gradient and shape2D histogram features respectively. On that basis, for head detection, we further propose filter method of motion and appearance likelihoods based on Bayesian theory. For human upper body detection and segmentation, we further propose foreground segmentation method combined background subtraction and energy function optimization. Moreover, we design a GPU acceleration algorithm based on CUDA in computing HOG feature. The proposed head detection method reduce false positives effectively while proposed human upper body detection and segmentationi method can achieve extraction of upper body regions correctly.
     3. In the field of objects tracking, we propose an objects tracking method based earth mover's distance (EMD) and SURF feature points. We introduce the idea of reducing the problem of tracking objects with SURF feature points to the linear programming problem which solves EMD. Otherwise, we propose two phases tracking strategy, which means coarse-to-fine idea and the solution of multi-objects occlusion based on Bayesian framework. The proposed tracking method can locate multiple objects for a long time and achieve robustness and reliability.
     Experiments have proven that the three proposed methods are available and have excellent performance. We have integrated them into real surveillance systems and achieved applicable results.

引文

[1]GHealey, D.Slater, T.Lin. A system for real-time fire detection[C]//IEEE Computer Vision and Pattern Recognition Conference,1993:605-606
    [2]Y.F.Simon. A rule-based machine vision system for fire detection in aircraft dry days and engine compartments [J]. Knowledge-Based Systems,1996(9):531-540
    [3]W.Phillips, M.Shah, N.Lobo. Flame recognition in video[C]//Proceedings of the Fifth IEEE Workshop on Applications of Computer Vision,2000:224-229
    [4]http://www.videosmokedetection.com/vsd8.htm
    [5]C.B.Liu, N.Ahuja. Vision based fire detection [C]//IEEE International Conference on Pattern Recognition,2004:134-137.
    [6]T.Schultze. Audio-video fire-detection of open fires [J]. Fire Safety Journal, 2006(41):311-314
    [7]W.Krull. Design and test methods for a video-based cargo fire verification system for commercial aircraft [J]. Fire Safety Journal,2006(41):290-300
    [8]B.U.Toreyin, Y.Dedeoglu, U.Gudukbay. Computer vision based method for real-time fire and flame detection [J]. Pattern Recognition Letters,2006, 27(1):49-58
    [9]B.U.Toreyin, Y.Dedeoglu, U.Gudukbay. Real-time fire and flame detection in video [C]//International Conference on Acoustics Speech, and Signal Processing, 2005:669-672
    [10]B.U.Toreyin, Y.Dedeoglu, A.E.Cetin. Flame detection in video using hidden Markov models [C]//International Conference on Image Processing, 2005:1230-1233
    [11]B.U.Toreyin, A.E.Cetin. Online detection of fire in video [C]//IEEE Conference on Computer Vision and Pattern Recognition,2007:1-5
    [12]B.C.Ko, K.H.Cheong, J.Y.NAM. Fire detection based on vision sensor and support vector machines [J]. Fire Safety Journal,2009,44(3):322-329
    [13]T.Celik, H.Demirel. Fire detection in video sequences using a generic color model [J]. Fire Safety Journal,2009,44(2):147-158
    [14]C.C.Ho, T.H.Kuo. Real-time video-based fire smoke detection system. [C]// IEEE International Conference on Advanced Intelligent Mechatronics,2009: 1845-1850
    [15]O.Gunay, K.Tasdemir, B.U.Toreyin. Video based wild fire detection at night [J]. Fire Safety Journal,2009,44(6):860-868.
    [16]B.U.Toreyin, A.E.Cetin. Wildfire detection using LMS based active learning. [C]//International Conference on Acoustics Speech, and Signal Processing,2009: 1461-1464
    [17]T.Cleary, W.Grosshandler. Survey of fire detection technologies and system evaluation/certification methodologies and their suitability for aircraft cargo compartments. U.S. Dept. of Commerce, Technology Administration, National Institute of Standards and Technology.
    [18]S.Noda, K.Ueda. Fire detection in tunnels using an image processing method[C]//Vehicle Navigation and Information Systems Conference, 1994:57-62
    [19]T.Ono, H.Ishii, K.Kawamura, H.Miura, E.Momma, T.Fujisawa, J.Hozumi. Application of neural network to analyses of CCD colour TV-camera image for the detection of car fires in expressway tunnels [J]. Fire Safety Journal, 2006(41):279-284
    [20]J.C.Owrutsky, D.A.Steinhurst, C.P.Minor, S.L.Rose-Pehrsson, F.W.Williams, D.T. Gottuk. Long wavelength video detection of fire in ship compartments [J]. Fire Safety Journal,2006(41):315-320
    [21]D.T.Gottuk, J.A.Lynch, S.L.Rose-Pehrsson, J.C.Owrutsky, F.W.Williams [J]. Video image fire detection for shipboard use [J]. Fire Safety Journal, 2006(41):321-326
    [22]金华彪.基于数字图像处理的火灾探测技术[J].消防科学与技术,2002,5(3)：46-47
    [23]吴龙标,宋卫国,卢结成.图像火灾监控中一个新颖的火灾判据[J].火灾科学,1997,6(2)：60-66.
    [24]卢结成,吴龙标,宋卫国.一种火灾图像探测系统的研究[J].仪器仪表学报2001,22(4)：437-440
    [25]D.J.Lee, P.Zhan, A.Thomas, R.Schoenberger. Shape-based human intrusion detection[C]//SPIE International Symposium on Defense and Security, Visual Information Processing XIII,2004(5438):81-91
    [26]J.Zhou, J.Hoang. Real time robust human detection and tracking system[C]// IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005(3):149
    [27]D.Toth, T.Aach. Detection and recognition of moving objects using statistical motion detection and fourier descriptors[C]//International Conference on Image Analysis and Processing,2003:430-435
    [28]1 C.R.Wren, A.Azarbayejani, T.Darrell, A.P.Pentland. Pfinder:real-time tracking of the human body [J]. PAMI,1997(97):780-785
    [29]H.Eng, J.Wang, A.Kam, W.Yau. A bayesian framework for robust human detection and occlusion handling using a human shape model[C]//International Conference on Pattern Recognition,2004(2):257-260
    [30]H.zein, S.Lakshmanan, P.Watta. A motion and shape based pedestrian detection algorithm[C]//IEEE Intelligent Vehicles Symposium,2003:500-504
    [31]A.Mcivor. Background subtraction techniques
    [32]M.Piccardi. Background subtraction techniques:a review[C]//International Conference on Systems, Man and Cybernetics,2004(4)
    [33]N.Dalai, B.Triggs, Histograms of oriented gradients for human detection[C]// CVPR,2005:886-893
    [34]P.Viola, M J.Jones, D.Snow. Detecting pedestrians using patterns of motion and appearance[C]//ICCV,2003(2):734-741
    [35]C.Hou, H.Ai, S.Lao. Multi-view Pedestrian Detection Based on Vector Boosting[C]//ACCV,2007:18-22
    [36]S.Maji, A.Berg, J.Malik. Classification using intersection kernel support vector machines is efficient[C]//CVPR,2008:1-8
    [371 http://www.vision.ee.ethz.ch/-calvin/calvin_upperbody_detector/
    [38]B.Wu, R.Nevatia. Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors[C]//IJCV,2007, 75(2):247-266
    [39]J.Xing, H.Ai, S.Lao. Multiple Human Tracking Based on Multi-View Upper-Body Detection and Discriminative Learning[C]//ICPR,2010:1698-1701
    [40]C.Stauffer, W.E.L.Grimson. Adaptive background mixture models for real-time tracking[C]//CVPR,1999(2):246-252
    [41]K.Kim, T.H.Chalidabhongse, D.Harwood, L.Davis. Real-time foreground-background segmentation using codebook model [J]. Real-Time Imaging,2005, 11(3):172-185
    [42]Y.Z.Liu. Nonparametric background generation [J]. Journal of Visual Comm. and Image Rep.,2007,18(3):253-263
    [41]W.GAO, H.Ai, S.Lao. Adaptive Contour Features in Oriented Granular Space for Human Detection and Segmentation[C]//CVPR,2009:1786-1793
    [44]Z.Lin, L.Davis, A pose-invariant descriptor for human detection and segmentation[C]//ECCV,2008:423-436
    [45]T.Zhao, R.Nevatia. Bayesian human segmentation in crowded situations[C]// CVPR,2003(2):459-466
    [46]T.Zhao, R.Nevatia. Stochastic human segmentation from a static camera[C]// Workshop on Motion and Video Computing,2002:9-14
    [47]I.Haritaoglu, D.Harwood, L.S.David. W4:Real-Time Surveillance of People and Their Activities [J]. PAMI,2000,22(8):809-830
    [48]T.B.Moselund, A.Hilton, V.Kruger. A survey of advances in vision-based human motion capture and analysis [J]. Computer Vision and Image Understanding, 2006(104):90-126
    [49]K.Y.Song, J.Kittler, M.Petrou. Defect detection in random color textures [J]. Israeal Verj. Cap.,1996,14(9):667-683
    [50]G.Paschos. Perceptually uniform color spaces for color texture analysis:an empirical evaluation [J]. IEEE Transactions on Image Process.2001(10):932-937
    [51]J.Canny. A computational approach to edge detection [J]. IEEE Trans. Pattern Analysis and Machine Intelligence,1986,8(6):679-698
    [52]K.Bowyer, C.Kranenburg, S.Dougherty. Edge detector evaluation using empirical roc curve [J]. Comput. Vision Image Understand,2001 (10):77-103
    [53]B.Horn, B.Schunk. Determining optical flow [J]. Artificial Intelligence, 1981(17):185-203
    [54]B.D.Lucas, T.Kanade. An iterative image registration technique with an application to stereo vision[C]//International Joint Conference on Artificial Intelligence,1981
    [55]J.Barron, D.Fleet, S.Beauchemin. Performance of optical flow techniques [J]. Int. J. Comput. Vision,1994(12):43-77
    [56]M.Black, P.Anandan. The robust estimation of multiple motions:Parametric and piecewise-smooth flow fields [J]. Compute. Vision Image Understand,1996, 63(1):75-104
    [57]R.Szeliski, J.Coughlan. Spline-based image registration [J]. Int. J. Comput. Vision,1997,16(1-3):185-203
    [58]R.Haralick, B.Shanmugam, I.Dinstein. Textural features for image classification [J]. IEEE Trans. Syst. Man Cybern.,1973,33(3),610-622
    [59]K.Laws. Textured image segmentation. PhD thesis, Electrical Engineering, University of Southern California,1980
    [60]S.Mallat. A theory for multi-resolution signal decomposition:The wavelet representation [J]. IEEE Trans. Patt. Analy. Mach. Intell, PAMI,1989, 11(7):674-693
    [61]H.Greenspan, S.Belongie, R.Goodman, P.Perona, S.Rakshit, C.Anderson. Over-complete steerable pyramid filters and rotation invariance[C]//CVPR, 1994:222-228
    [62]D.Comaniciu, V.Ramesh, P.Meer. Real-time tracking of non-rigid objects using mean shift[C]//CVPR,2000(2):142-149
    [63]J.P.Lewis. Fast Normalized Cross-Correlation [J]. Vision Interface,1995
    [64]S.Birchfield. Elliptical head tracking using intensity gradients and color histograms[C]//CVPR,1998:232-237
    [65]H.Schweitzer, J.W.Bell, F.Wu. Very fast template matching[C]//European Conference on Computer Vision, ECCV,2002:358-372
    [66]P.Fieguth, D.Terzopoulos. Color-based tracking of heads and other mobile objects at video frame rates[C]//CVPR,1997:21-27
    [67]D.Comaniciu, V.Ramesh, P.Meer. Kernel-based object tracking [J]. PAMI, 2003(25):564-575
    [68]D.Comaniciu, P.Meer. Mean shift:A robust approach toward feature space analysis [J]. PAMI,2002,24(5):603-619
    [69]A.Jepson, D.Fleet, T.Elmaraghi. Robust online appearance models for visual tracking[C]//CVPR,2003,25(10):1296-1311
    [70]H.Tao, H.Sawhney, R.Kumar. Object tracking with bayesian estimation of dynamic layer representation [J]. PAMI,2002,24(1):75-89
    [71]M.Isard, J.MacCormick. Bramble:A bayesian multiple-blob tracker[C]//ICCV, 2001:34-41
    [72]A.Yilmaz, X.Li, M.Shan. Target tracking in airborne forward looking imagery [J]. J. Image Vision Comput.,2003,21(7):623-635
    [73]Y.Cai, N.Freitas, J.Little. Robust visual tracking for multiple targets[C]//ECCV, 2006(3954):107-118
    [74]M.Isard, A.Blake. Icondensation:Unifying low-level and high-level tracking in a stochastic framework[C]//ECCV,1998(1407)
    [75]Z.Khan, T.Balch, F.Dellaert. An MCMC-based particle filter for tracking multiple interacting targets[C]//ECCV,2004(3024):279-290
    [76]K.Smith, D.GPerez, J.Odobez. Using particles to track varying numbers of interacting people[C]//CVPR,2005
    [77]T.Zhao, R.Nevatia. Tracking multiple humans in crowded environment[C]// CVPR,2004
    [78]VPhilomin, R.Duraiswami, L.Davis. Quasi-Random Saompling for Condensation[C]//ECCV,2000(1843):134-149
    [79]Y.Li, H.Ai, T.Yamashita, S.Lao, M.Kawade. Tracking in low frame rate video:A cascade particle filter with discriminative observers of different lifespans[C]// CVPR,2007
    [80]J.Kwon, K.M.Lee. Tracking of Abrupt Motion Using Wang-Landau Monte Carlo Estimation[C]//ECCV,2008:387-400
    [81]J.Kwon, K.M.Lee. Visual tracking decomposition[C]//IEEE Conference on Computer Vision and Pattern Recognition, CVPR,2010:1269-1276
    [82]E.Maggio, E.Piccardo, C.Regazzoni, A.Cavallaro. Particle PHD Filtering for Multi-Target Visual Tracking[C]//IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP,2007:1101-1104
    [83]L.Cerman, J.Matas, V.Hlavac. Spuntnik tracker:Looking for a companion improves robustness of the tracker[C]//Proceeding Scandinavian Conference on Image Analysis,2009
    [84]H.Grabner, J.Matas, L.V.Gool, P.Cattin. Tracking the Invisible:Learning Where the Object Might be[C]//IEEE Conference on Computer Vision and Pattern Recognition, CVPR,2010:1285-1292
    [85]V.Lepetit, P.Lagger, P.Fua. Randomized trees for real-time keypoint recognition[J]//CVPR,2005
    [86]D.Ramanan, D.Forsyth, A.Zisserman. Strike a pose:Tracking people by finding stylized poses[C]//CVPR,2005
    [87]M.Andriluka, S.Roth, B.Schiele. People-tracking-by-detection and people-detection-by-tracking[C]//CVPR,2008
    [88]M.Ozuysal, V.Lepetit, F.Fleuret, P.Fua. Feature harvesting for tracking-by-detection[C]//ECCV,2006
    [89]C.Rosenberg, M.Hebert, H.Schneiderman. Semi-supervised self-training of object detection models[C]//WACV,2005
    [90]P. Roth, M. Donoser, and H. Bischof. On-line learning of unknown hand held objects via tracking[C]//ICVS,2006
    [91]S.Avidan. Ensemble tracking [J]. PAMI,2007,29(2):261-271
    [92]R.Collins, Y.Liu, M.Leordeanu. Online selection of discriminative tracking features [J]. PAMI,2005,27(10):1631-1643
    [93]L.Ellis, N.Dowson, J.Matas, R.Bowden. Linear predictors for fast simultaneous modeling and tracking[C]//ICCV,2007:1-8
    [94]H.Grabner, H.Bischof. On-line boosting and vision[C]//CVPR,2006
    [95]I.Matthews, T.Ishikawa, S.Baker. The template update problem [J]. PAMI,2004, 26(6):810-815
    [96]S.Avidan. Support vector tracking [J]. PAMI,2004,26(8):1064-1072
    [97]J.Gall, N.Razavi, L.V.Gool. Online adaption of class-specific codebooks for instance tracking[C]//British Machine Vision Conference, BMVC,2010
    [98]A.Perera, C.Srinivas, A.Hoogs, G.Brooksby, H.Wensheng. Multi-Object Tracking Through Simultaneous Long Occlusions[C]//CVPR,2006:666-673
    [99]C.Huang, B.Wu, R.Nevatia. Robust Object Tracking by Hierarchical Association of Detection Responses[C]//ECCV,2008:788-801
    [100]C.Beleznai, B.Fruhstuck, H.Bischof. Multiple Object Tracking Using Local PCA[C]//ICIP,2006
    [101]B.Benfold, I.Reid. Stable multi-target tracking in real-time surveillance video[C]//CVPR,2011:3457-3464
    [102]Z.Kalal, J.Matas, K.Mikolajczyk. Online learning of robust object detectors during unstable tracking[C]//Computer Vision Workshops, ICCV Workshops, 2009:1417-1424
    [103]Z.Kalal, J.Matas, K.Mikolajczyk. P-N learning:Bootstrapping binary classifiers by structural constraints[C]//CVPR,2010:49-54
    [104]C.Vondrick, D.Ramanan. Video Annotation and Tracking with Active Learning [J]. Neural Information Processing Systems, NIPS,2011
    [105]J.Berclaz, F.Fleuret, E.Turetken, P.Fua. Multiple Object Tracking Using K-Shortest Paths Optimization [J]. PAMI,2011,33(9):1806-1819
    [106]B.Settles. Active learning literature survey. Computer Sciences Technical Report 1648, University of Wisconsin-Madison,2009
    [107]R.T.Collins, A.J.Lipton, T.Kanade. A system for video surveillance and monitoring [R]. Robotics Institute,1999.
    [108]C.M.Bishop. Pattern recognition and machine learning [M]. New York: Springer,2006:232-241.
    [109]F.Leymarie, M.D.Levine. Tracking deformable objects in the plane using an active contour model [J]. PAMI,2002(15):617-634
    [110]Q.Zhao, S.Brennan, H.Tao. Differential EMD Tracking[C]//ICCV,2007:1-8
    [111]GB.Dantzig, M.N.Thapa. Linear Programming. Springer-Verlag New York, LLC,1997.
    [112]Y.Ruber. Perceptual metrics for image database navigation,1999.
    [113]H.Bay, A.Ess, T.Tuytelaars, L.V.Gool. SURF:Speeded Up Robust Features[C]//Computer Vision and Image Understanding (CVIU),2008, 110(3):346-359
    [114]J.Shi, C.Tomasi. Good Features to Track[C]//CVPR,1994:593-600

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700