用户名: 密码: 验证码:
可伸缩视频编码传输速率控制技术研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
近年来,随着网络传输技术和视频编码技术的飞速发展,基于网络的视频服务得到了广泛的应用。现有的视频应用环境具有终端设备多样、网络形式异构和用户需求复杂等特点,给网络视频应用带来了诸多的困难和挑战。随着用户需求的发展,人们越来越不满足于现有的视频传输技术,而寻求更加便捷和有效的视频通信方法。视频传输过程会极大影响终端用户的视频重建质量。在实际应用环境中,传输信道的带宽是受限且时变的,速率约束是影响视频传输的主要因素之一。视频传输速率控制是解决视频码流在时变信道传输时可能造成接收端视频主客观质量下降的有效方法。研究视频的传输速率控制技术具有重要的理论意义和实用价值。
     本文在深入分析近年来流行的视频传输技术实时传输协议RTP (Realtime Transport Protocol)和基于HTTP的动态自适应流媒体DASH (Dynamic Adaptive Streaming over HTTP)的技术特点和应用场景的基础上,结合最新的可伸缩视频编码SVC (Scalable Video Coding)国际标准,研究相应的传输速率控制技术,以解决不同应用环境下视频的可靠传输问题。本文的主要工作和创新点如下:
     1.给出分组交换网络基于数据包速率的传输特性;
     本文对基于分组交换网络的视频传输过程进行分析,针对视频传输过程的特性进行合理简化,对影响网络丢包率的原因进行理论分析和实验仿真。结果表明,在网络丢包率较低的情况下,若突发数据长度与队列缓冲区长度近似,则网络丢包率与码流数据包速率之间具有直接的联系,相同数据包速率的码流对网络拥塞造成的影响基本相同。该结果指出低丢包率环境下视频传输更为有效的性能衡量标准,为传输速率控制方法提供理论依据和比较标准。
     2.提出针对不同应用环境的速率控制方法;
     基于已有的针对分组交换网络传输特性的研究结果,针对视频在高拥塞有线网络上传输的情况,提出了拥塞感知的传输速率控制方法。该方法基于中等粒度质量可伸缩MGS (Medimum Grain Scalability),在不加剧传输网络拥塞状态从而不引起数据包丢失的情况下,有效提高接收端视频重建质量。实验表明,该方法可以获得平均0.4dB的性能提升。
     针对视频传输时从有线网络到无线网络的转换过程,提出了自适应的数据包封装方法。该方法通过综合考虑无线网络比特错误与数据包长度的关系,以及视频码流不同部分重要性的差异,有效利用无线信道的传输带宽,从而提高接收端视频重建质量。实验表明,该方法可以获得平均0.72dB的性能提升。
     针对网络严重拥塞出现数据包丢失的情况,综合分析已有前向纠错编码的优缺点,提出了全新的帧内前向纠错保护方法IP-FEC (Internal Packet FEC)。提出的IP-FEC可以在不增大网络丢包率的情况下,提供一定的错误保护能力。同时,提出的IP-FEC与传统的FEC方法可以简单级联,从而保证不与现有系统相冲突。实验表明,提出的IP-FEC方法相比于无FEC情况,在3%丢包率情况下可以获得平均4.7dB的性能提升,在5%丢包率情况下可以获得平均4.9dB的性能提升;相比于增加10%校验数据包的FEC方法,在3%丢包率情况下可以获得平均1.1dB的性能提升,在5%丢包率情况下可以获得平均1.7dB的性能提升。
     3.提出DASH上的基于的SVC快速码流切换方法
     本文基于最新的DASH视频传输方法,分析了现有网络条件下视频通信的困难,针对网络缓冲区较小和缓冲更新算法不匹配的问题,提出了自适应预测更新算法。通过估计用户选择节目内容和用户信道带宽变化的转移概率,给出了近似最优基于SVC的缓冲更新算法。实验表明,提出的自适应更新算法在用户以较大概率切换节目的场景下可以提升平均约12.6%的缓存命中率,同时在其它场景下提供不弱于现有算法的性能。
Along with the fast development of network and video coding techniques, the network-based video service has been widely applied. There are mainly three characteristics in the current video application environment:heterogeneous networks, various receiving terminals and complexity of users' demands. These features lead to difficulties as well as challenges for network-based video service. With the increasement of user requirements, people are less satisfied with the current video transmision methods and seek for a more convenient and effective video communication solution. The video transmission process will greatly affect the quality of the reconstructed video. In practical environment, the transmission channel is bandwidth limited and time varying and the rate constraint is an important factor affecting the video transmission. Video transmission rate control is an effective way to solve the problem of potential degrading of the video quality, both objectively and subjectively, at receiver side due to the imperfect transmission channel. Therefore, the video transmission rate control has theoretical significance and important practical usage.
     The dissertation studies the transmission rate control under the basis of scalable video coding (SVC). The main contents and novelties are as follows:
     1) This dissertation investigates the transmission characteristics with respect to the packet rate of the packet switching network.
     This dissertation analyzes and models the transmission process based on the packet-switched network, simplifies the model under the characteristics of video transmission and simulates the video trasnmission process. The results show that when the packet loss rate is low and the length of burst is as same as the length of buffer, the packet loss rate is directly related to the packet rate of the stream. The streams with the same packet rate will bring the same network congestion. The result gives a better performance metrics for video transmission under low packet rate and provides theory basis and standard for the performance comparison method.
     2) This dissertation proposes the transmission rate control methods for different scenarios.
     Based on the research results of the packet-switched network, this dissertation proposes the congestion aware transmission rate control method for video transmission under the high congestion wire network. The method is based on the medimum grain scalability of SVC, and provides better reconstruction video quality while not aggravating the network congestion to bring packet loss. The results show that the method can get an average0.4dB gain.
     This dissertation proposes the adaptive packet encapsulation method for the video transmission from wire to wireless. The method considers the relationship among the wireless bit error rate, packet length and the importance of the different parts of the video stream. The method can utilize the bandwidth of the wireless channel and gives a better reconstruction video quality. The results show that the method can get an average0.72dB gain.
     This dissertation focuses on the situation of serious congestion network with packet loss. This dissertation analyzes the previous FEC methods and proposes the internal packet FEC(IP-FEC). The IP-FEC provides a certain degree of error protection capability without aggravating the network congestion. The IP-FEC can be easily combined with the previous FEC methods. The results show that the IP-FEC can get an average4.7dB gain compared with no-FEC under3%packet loss rate and an average4.9dB gain under5%packet loss rate,1.1dB and1.6dB respectively compared with traditional FEC with10%parity packets. The reconstruction video quality is better than the previous FEC method with at least10%parity.
     3) This dissertation proposes the novel fast stream switching method.
     This dissertation analyzes the problems of video communication under current network conditions based on the latest DASH method. This dissertation focuses on the the problems of small buffer and buffer replacement algorithm mismatch, proposes the adaptive predict update algorithm. The proposed algorithm gives the approximate optimal replacement results from estimating the transition probabilities of the program content and channel bandwidth. The results show that the proposed algorithm can achieve about12.6%hit ratio gain under the high switching scenario and comparable performance under other scenes.
引文
中国国家质监总局中国家标准化管理委员会。2006。GB/T 20090.2-2006。信息技术先进音视频编码第2部分:视频[S]。中国标准出版社。
    Abdelhamid Nafaa, Tarik Taleb, and Liam Murphy, "Forward Error Correction Strategies for Media Streaming over Wireless Networks", IEEE Communication Magazine, vol.46, no.1, pp. 72-79, January 2008.
    Brennan M. and Syn M., "Television Viewing Behavior During Commercial Breaks",2001 Australian & New Zealand Marketing Academy Conference (Anzmac 2001), Dec 2001.
    Cheng-Han Lin, Ce-Kuen Shieh, Naveen K. Chilamkurti, Chih-Heng Ke, and Wen-Shyang Hwang, "A RED-FEC Mechanism for Video Transmission over WLANs", IEEE Transactions on Broadcasting, vol.54, no.3, pp.517-524, September 2008.
    Cisco,2010. "Cisco 12000 series internet router architecture:Packet switching". http://www.cisco.com/en/US/products/hw/routers/ps167/products_tech_note09186a00801e1dc 1.shtml
    Cisco,2012. Cisco Visual Networking Index:Forecast and Methodology,2011-2016. http://www.cisco.com/en/US/solutions/collateral/ns341/ns525/ns537/ns705/ns827/white_paper c11-481360.html
    G. Bjontegaard, "Calculation of average PSNR differences between RD curves," ITU-T Doc. VCEG-M33, Mar.2001.
    H. Bahn, K. Koh, S. H. Noh, and S. Lyul Min, "Efficient Replacement of Nonuniform Objects in Web Caches", IEEE Computer Magazine, Vol.35, No.6, P.65-73, June 2002.
    Hua Yang, and Kenneth Rose, "Advances in Recursive Per-Pixel End-to-End Distortion Estimation for Robust Video Coding in H.264/AVC", IEEE Transactions on Circuits and Systems for Video Technology, Vol.17, No.7, July 2007.
    I. Amonou, N. Cammas, S. Kervadec, and S. Pateux, "Optimized Rate-Distortion Extraction With Quality Layers in the Scalable Extension of H.264/AVC", IEEE Transactions on Circuits and System for Video Technology, vol.17, no.9, Sep.2007.
    I. S. Reed and G. Solomon. "Polynomial codes over certain finite fields", Journal of the Society for Industrial and Applied Mathematics,8(2):300-304, June 1960
    IETF,2003. RFC 3550:RTP:A transport protocol for real-time application, July 2003.
    IETF,2004. L-A. Larzon, M. Degermark, et al. "The lightweight User Datagram Protocol (UDP-Lite)", RFC3828.
    IETF,2011. IMIX Genome:Specification of variable packet sizes for additional testing[S].
    ISO/IEC JTC 1.2004. ISO/IEC 14492-2 (MPEG-4 Visual). Coding of audio-visual objects—Part 2:Visual [S].3rd ed.
    ISO/IEC.1993. Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s—part 2:video[S].
    ISO/IEC.2012. ISO/IEC 23009-1:2012. Information technology--Dynamic adaptive streaming over HTTP (DASH) -- Part 1:Media presentation description and segment formats[S].
    ITU-T,2009. T.802:Information technology-JPEG 2000 image coding system:Motion JPEG 2000.2005-01. Retrieved 2009-11-01.
    ITU-T, ISO/IEC.1994. Generic coding of moving pictures and associated audio information—part 2:video[S].
    ITU-T, ISO/IEC.2005. Advanced video coding for generic audiovisual services[S].
    ITU-T, ISO/IEC.2007. ITU-T Rec. H.264 and ISO/IEC 14496-10. Advanced video coding for generic audio-visual services[S].8th ed.
    ITU-T.1993. ITU-T Rec. H.261.Video coding for audiovisual services at p×64kbit/s [S].2nd ed.
    ITU-T.2000. ITU-T Rec. H.263. Video coding for low bit rate communication [S].3rd ed.
    J. Korhonen, and Ye Wang, "Effect of Packet Size on Loss Rate and Delay in Wireless Links," Wireless Communications and Networking Conference, IEEE,2005.
    J. Lacan, V. Roca, J. Peltotalo, and S. Peltotalo. Reed-Solomon Forward Error Cor-rection (FEC) Schemes. Internet RFCs, RFC 5510, April 2009.
    K. Park, and W. Wang, "AFEC:An Adaptive Forward Error Correction Protocol for End-to-End Transport of Real-Time Traffic", International Conference on Computer Communications and Networks, pp.196-205, October 1998.
    K. Takahata, N. Uchida, and Y. Shibata, "Packet Error and Frame Rate Controls for Real Time Video Stream over Wireless LANs", International Conference on Distributed Computing Systems Workshops, pp.594-599,2003.
    L.B. Lim, L. Guan, A. Grigg, I.W.Philips, et al, "RED and WRED Performance Analysis Based on Superposition of N MMBP Arrival Process", Internation Conference on Advanced Information Networking and Applications,2010.
    Liu H, Li HQ, Wang YK.2005. Showcase of scalability information SEI message [M]. JVT Doc: JVT-Q067.
    M. M. Hannuksela, H. Zhu, H. Li, and M. Gabbouj, "Congestion Aware Transmission Rate Control using Medium Grain Scalability of Scalable Video Coding", International Conference on Image Processing (ICIP),2010.
    M. Van Der Schaar, S. Krishnamachari, S. Choi, and X. Xu, "Adaptive Cross-Layer Protection Strategies for Robust Scalable Video Transmission Over 802.11 WLANs", IEEE Journal on Selected Areas in Communications, Vol.21, No.10, Dec.2003.
    Meng Liu, Yi Guo, Houqiang Li, Chang Wen Chen, "Low-complexity rate control based on p-domain model for Scalable Video Coding", International Conference on Image Processing, ICIP,2010.
    N. Hohn, D. Veitch, K. Papagiannaki, and C. Diot, "Bridging Router Performance and Queuing Theory", Proceedings of the joint international conference on Measurement and modeling of computer systems, vol.32, no.1, pp.355-366, June 2004.
    Nick McKeown, "The iSLIP Scheduling Algorithm for Input-Queued Switches", IEEE/ACM Transactions on Networking, vol.7, no.2, pp.188-201, April 1999.
    Ohm JR.1994. Three-dimensional subband coding with motion compensation [J]. IEEE Trans. Image Processing.3(5):559-571.
    Roman Chertov, and Sonia Fahmy, "Forwarding Devices:From Measurements to Simulations", ACM Transactions on Modeling and Computer Simulation, vol.21, no.2, February 2011.
    S. Podlipnig, L. Boszormenyi, "A survey of Web cache replacement strategies", ACM Computering Surveys, Vol.35, P.331-373,2003.
    S. Wenger, M. M. Hannuksela, T. Stockhammer, M. Westerlund, and D. Singer, "RTP payload format for H.264 video," IETF RFC 3984, Feb.2005.
    S. Wenger, Y.-K. Wang, T. Schierl, and A. Eleftheriadis, "RTP payload format for SVC video," draft, Internet Engineering Task Force (IETF), September 2009.
    Sally Floyd, and Van Jacobson, "Random Early Detection Gateways for Congestion Avoidance", IEEE/ACM Transactions on Networking, vol.1, no.4, pp.397-413, August 1993.
    Schwarz H, Hinz T, Marpe D, et al.2005. Constrained inter-layer prediction for single-loop decoding in spatial scalability [C]. IEEE Proc. ICIP.2:870-873.
    Schwarz H, Marpe D, Wiegand T.2006. Analysis of hierarchical B pictures and MCTF. IEEE Proc. ICME.2006:1929-1932.
    T. Wiegand, J. Ohm, G. Sullivan, and A. Luthra. Special Issue on Scalable Video Coding Standardization and Beyond. IEEE Transactions on Circuits and Systems for Video Technology, vol.17, no.9, Sep.2007.
    Weiping Li, "Overview of fine granularity scalability in MPEG-4 video standard", IEEE Transactions on Circuits and Systems for Video Technology, Vol.11, No.3, Mar 2001.
    Wu Yuan, Shouxun Lin, Yongdong Zhang, Wen Yuan, Haiyong Luo, "Optimum bit allocation and rate control for H.264/AVC", IEEE Transactions on Circuits and Systems for Video Technology, Vol.16, No.6, June 2006.
    Y. Sanchez, T. Schierl, C. Hellge, et al. "Efficient HTTP-based streaming using Scalable Video Coding", Signal Processing:Image Communication, Vol.27, No.4, April 2012.
    Y. Sanchez, T. Schierl, C. Hellge, et al. "iDASH:Improved Dynamic Adaptive Streaming over HTTP using Scalable Video Coding", Proceedings of the second annual ACM conference on Multimedia systems,2011.
    Y. Shan, "Cross-layer techniques for adaptive video streaming over wireless networks," EURASIP Journal on Applied Signal Processing, vol.2005:2, pp.220-228.
    Yi Guo, Houqiang Li, Ye-kui Wang.2005. SVC/AVC loss simulator, JVT-Q069[M].
    Yi Guo, Ying Chen, Ye-kui Wang, Houqiang Li, M.M. Hannuksela, "Error Resilient Coding and Error Concealment in Scalable Video Coding", IEEE Transactions on Circuits and Systems for Video Technology, Vol.19, No.6, June 2009.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700