用户名: 密码: 验证码:
低速率多带激励线性预测语音编码技术的研究
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
近年来,随着无线通信产业的发展,用户需求快速增长,导致可供使用的频谱资源越来越少,提高频谱利用率成为解决频谱资源短缺的重要手段。语音通信是无线通信的基本业务应用,低速率、高质量一直是其追求的目标。多带激励(Multi-Band Excitation, MBE)模型是语音编码算法中低速高质的一种典型代表。该模型采用了参数编码,相对于波形编码降低了编码速率;同时,此模型将频段进行了更细小的划分,提高了清/浊音判决的精确度,从而提高了语音的质量。
     本文工作包括语音通信中多模多带线性预测语音编码算法的研究以及矢量量化方法的改进,主要创新性成果如下:
     1.提出了一种多模多带激励线性预测(Multimode MBE with Linear Predibtive Coding, MMBE-LPC)语音编解码器模型。这种编解码器与现有的MBE编解码器相比在两个方面有所改进。一、解决了频谱幅度变维量化的难题。结合线性预测的方法,将维数变化的频谱幅度转换为维数固定的线性预测系数;并将线性预.测系数用线谱频率(Line Spectral Frequency, LSF)参数表示并进一步量化,保证了量化的精确度。二、提出了一种子带划分方法和清/浊音判决阈值。先确定每帧中子带的个数,再对各子带进行清/浊音判决,进而对判决结果进行模式划分。由于在不同的清/浊音判决模式下,LSF参数的统计特性不同,因此采用不同的码书对LSF参数矢量进行量化,提高了量化的质量。另外,该模型采用了一种与能量相关的、自适应的清/浊音判决阈值,比MBE的阈值设计更加简化。仿真结果显示,时域合成语音的清音区和浊音区划分清晰,且与原始语音一致,语谱图也拟合得很好。
     2.提出了一种量化LSF矢量参数的滑动平均多级分裂矢量量化方法(Moving Average Multi-Stage Split Vector Quantization, MA-MS-SVQ)。采用该方法生成码书,更充分地利用了线谱频率参数帧内和帧间的相关性,减小了码书的存储空间,降低了码书的搜索复杂度。这种矢量量化器将线谱频率参数去除平均值后进行一阶滑动平均预测,将残差进行三级矢量量化。在第二级量化时,将高维线谱频率参数矢量分裂成两个低维的部分,分别用不同的码书进行量化。仿真结果表明,在低速率编码下,合成语音的平均谱失真达到0.91dB,2dB~4dB的谱泄露为0.13%,无4dB以上谱泄露。码书的存储空间和搜索复杂度均降低了31%以上。
     3.提出了一种贪婪树初始码书间距最大化算法(Most Dispersed Greedy TreeGrowing Algorithm, MD-GTGA),用来设计Linde-Buzo-Gray (LBG)算法初始码书,解决了LBG算法容易陷入局部最优的问题。MD-GTGA首先采用贪婪树生长算法(Greedy Tree Growing Algorithm, GTGA)生成基础码书,然后再采用码书间距最大化算法(Most Dispersed Codewords in Initialization, MDCI)从基础码书中生成初始码书。在仿真中对随机法、分裂法、GTGA和MDCI算法进行了比较,结果表明,采用GTGA生成LBG算法初始码书,合成语音的平均谱失真最小。与GTGA和MDCI算法相比,MD-GTGA降低了量化的平均失真度与平均谱失真。
     4.提出了一种改进的成对最近邻(Improved Pariwise Nearest Neighbors, IPNN)算法,用来生成LBG算法初始码书。该算法首先采用随机法或者分裂法选定预备码书,然后采用PNN的合并方法,以最相邻原则将训练矢量依次合并到预备码书的码字中。仿真结果显示,与PNN (Pariwise Nearest Neighbors, PNN)算法相比,IPNN算法生成LBG初始码书的训练时间短;与随机法相比,用分裂法生成的预备码书性能更稳定。仿真测试结果表明,用该算法生成LBG初始码书,合成语音的平均谱失真在1dB左右,2dB-4dB的谱泄露小于2%,无4dB及以上的谱泄露。
With the rapid development of wireless communication, subscribers'demand increases fast. This leads to less and less available spectrum resources. An important way to solve above problem is to improve the spectrum efficiency. Speech communication, with objectives of low bit-rate and high quality, is a fundamental service in wireless communication. Multi-Band Excitation (MBE) algorithm is one of the typical representatives of low-rate and high-quality speech coding. Since it is a parameter coding method, the rate is lower than that of waveform coding. While the fine sub-band segmentation results in accurately Unvoiced/Voiced (U/V) decision, which follows high-quality synthetic speech.
     The work of this dissertation consists of investigations on Multimode MBE with Linear Prediction and improvements of vector quantization. The main contributions are as follows.
     1. Multimode MBE with Linear Prediction vocoder is proposed. Compared with available MBE coding, this vocoder has improvements at two points. Firstly, it sovles the problem of variable dimension vector quantization of spectral amplitudes.The variable-dimension spectral amplitudes are converted to fix-dimension Linear Predictive coefficients, which are denoted and quantized by Line Spectral Frequency (LSF) parameters, so that the precision of quantization is guaranteed. Secondly, a novel sub-bands segmentation method and a U/V decision threshold are designed. The number of sub-bands in one frame is fixed and U/V decisions are made in each sub-band, followed by the categration of modes. Due to the different statistic distributions in different modes, the LSF vectors are quantized by different codebooks to improve the quality of quantization. Moreover, an adaptive threshold related to energy, which is much simpler than MBE, is exploited for U/V decision. The simulation results indicate that the unvoiced and voiced parts of synthetic speech are distinguishable and consistent with the original speech, and the spectrum of synthetic speech well fits the original one.
     2. A Moving Average Multi-Stage Split Vector Quantizer (MA-MS-SVQ) is developed to quantize LSF parameters. Inter and intra frame correlations of LSF coefficients are fully utilized, and the storage and searching complexity of codebooks are reduced. LSF coefficients, the average of which is removed, are predicted by a first order moving average predictor. Then the residual LSF coefficients are quantized by a three-stage vector quantizer. In the second stage, each high-dimensional LSF coefficients vector is split into two low-dimensional parts, which are quantized respectively. Simulation results demonstrate that, under the precondition of low bit rate speech coding, the average spectral distortion of synthetic speech is 0.91 dB, the percentage of outliers between 2dB and 4dB is 0.13%and there is no outliers larger than 4 dB. Moreover, both storage and searching complexity of codebooks are reduced by more than 31%.
     3. A Most Dispersed and Greedy Tree Growing Algorithm (MD-GTGA) is proposed to generate Linde-Buzo-Gray (LBG) algorithm initial codebook. MD-GTGA overcomes the disadvantage of LBG algorithm that it is likely to be trapped into local optimizers. In this algorithm, a fundamental codebook was generated by Greedy Tree Growing Algorithm (GTGA), from which an initial codebook was obtained by Most Dispersed Codewords in Initialization (MDCI) algorithm. Random, Split, GTGA and MDCI algorithms are compared by simulation. The results show that the Average Spectral Distortion (ASD) of synthetic speech by GTGA is the smallest. Compared with GTGA and MDCI algorithm, both the average distortion and ASD are reduced by MD-GTGA.
     4. An Improved Pairwise Nearest Neighbors (IPNN) algorithm is constructed to generate the LBG initial codebook. In this algorithm, the preparative codebook is chosen randomly or by Split algorithm, and then the training vectors are merged one by one, by PNN algorithm, into a nearest code in preparative codebook. Simulation results indicate the decrease of the run time of the proposed algorithm, compared with PNN algorithm. It is also shown that, the final codebook obtained from Split preparative codebook is more stable than that from random method. Test is performed on IPNN algorithm, which shows that the average spectral distortion of syhthetic speech is around 1 dB, the percentage of outliers between 2dB and 4dB is less than 2%, and there is no outlier larger than 4dB.
引文
[1]郭梯云,杨家玮,李建东,数字移动通信(修订本),北京:人民邮电出版社,2001.
    [2]D.W.Griffin, J.S.Lim, "Multiband Excitation Vocoder," IEEE Transactions on acoustics, speech, and signal processing,1988,36(8):1223-1235.
    [3]李昌立,吴善培,数字语音——语音编码实用教程,北京:人民邮电出版社,2004.
    [4]赵晓群,数字语音编码,北京:机械工业出版社,2007.
    [5]王炳锡,语音编码,西安:西安电子科技大学出版社,2002.
    [6]赵力,语音信号处理,北京:机械工业出版社,2009.
    [7]鲍长春,“数字语音编码原理,西安:西安电子科技大学出版社,2007.
    [8]王洪,唐凯,低速率语音编码,北京:国防工业出版社,2006.
    [9]吴家安,现代语音编码技术,北京:科学出版社,2008.
    [10]Suat Yeldener, Marion R. Baraniecki, " A mixed harmonic excitation linear predictive speech coding for low bit rate applications," In Conference Record of the Thirty-Second Asilomar Conference on Signals, Systems& Computers, Pacific Grove, USA,Vol.l, pp.348-351,November 1998.
    [11]Paul C. Meuse, "A 2400 bps Multi-Band Excitation Vocoder," In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'90), pp.9-12, Albuquerque, NM, USA,1990.
    [12]Stachurski, J., McCree, A., Viswanathan, V, Heikkinen, A., Ramo, A., Himanen, S., Blocher, P., "Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s," IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP'03,), pp.II-153-156,2003.
    [13]Rahikka, D.J., Collura, J.S.,Fuja, T.E., Fazel, T., "US Federal Standard MELP vocoder tactical performance enhancement via MAP error correction," IEEE Military Communications Conference Proceedings (MILCOM), vol.2, pp.1458-1462,1999.
    [14]McCree, A., De Martin, J.C., "A 1.7 kb/s MELP coder with improved analysis and quantization," IEEE International Conference on Acoustics, Speech and Signal Processing, vol.2, pp.593-596,1998.
    [15]McCree, A., Kwan Truong,George, E.B., Barnwell, T.P., Viswanathan, V., "A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol.1, pp.200-203, 1996
    [16]McAulay R J.Quatieri T F, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Transactions on Acoustics, Speech and Signal Processing 34(4),pp.744-754,1986.
    [17]George, E.B., Smith, M.J.T., "Generalized overlap-add sinusoidal modeling applied to quasi-harmonic tone synthesis," IEEE Workshop on Applications of Signal Processing to Audio and Acoustics,pp.165-168,1993.
    [18]Macon, M.W., Clements M.A., "Sinusoidal modeling and modification of unvoiced speech," IEEE Transactions on Speech and Audio Processing,5(6),pp.557-560,1997.
    [19]George, E.B., Smith, M.J.T., "Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model," IEEE Transactions on Speech and Audio Processing,5(5),pp.389-460,1997.
    [20]Chong-White, N.R. Burnett, I.S., "Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding," IEEE Workshop on Speech Coding, pp.56,2000.
    [21]Li, H. Lockhart, GB., " Non-linear interpolation in prototype waveform interpolation (PWI) encoders," IEE Colloquium on Techniques for Speech Processing and their Application, pp.3/1,1994.
    [22]Chong-White, N.R. Burnett, I.S., "Accurate, critically sampled characteristic waveform surface construction for waveform interpolation decomposition," IEEE Electronics Letters,36(14), pp.1245,2000
    [23]Kleijn, W.B., "Speech coding below 4 kb/s using waveform interpolation," Global Telecommunications Conference (GLOBECOM)'Countdown to the New Millennium. Featuring a Mini-Theme on:Personal Communications Services', pp.1879,1991.
    [24]John C. Hardwick, Jae S. Lim, "The Application of the IMBE Speech Coder to Mobile Communication," In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'91), pp.249-252, Toronto, Ont., Canada,1991.
    [25]Juan Carlos De Martin, Allen Gersho, "Mixed-Domain Coding of Speech at 3 Kb/s," In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'96), pp.216-219, Atlanta, GA, USA,1996.
    [26]H.Yang, S.-N. Koh, P.Sivaprakasapillai, "Enhancement of multiband excitation (MBE) by pitch-cycle waveform coding," IEEE electronics letters, Vol.30, No.20, pp.1645-1646, September 1994.
    [27]Dusheng Wang, Lizhong Li, Jiankang Zhang, "An adaptive variable low bit rate multi-band excitation speech coder," In 2nd IEEE Conference on Industrial Electronics and Applications,2007 (ICIEA2007), pp.2244-2247, Harbin,2007
    [28]Cheung-Fat Chan, Eric W.M. Yu, "Frequency-domain postfilter for multiband excitation coding of speech at low bit rates," In IEEE 3rd International Conference on Signal Processing proceedings (ICSP1996), Vol.1, pp.682-685, October 1996.
    [29]Dahan Han, Huijuan Cui, Kun Tang. "Multimode Quantization of LP Parameters in MBE-LPC-FE Coder+," IEEE 2002 International Conference on Communications, Circuits and Systems and West Sino Expositions, vol.1, pp.19-22, June 2002.
    [30]Huijuan Cui, Hao Jiang, "A robust 800 bps MBE coder with VQ and MLP", In Internatioanl Conference on Communication Technology (ICCT'98), pp.S36-09-1-S36-09-4, October 1998.
    [31]S. Yeldener, A.M. Kondoz, B.G. Evans, "Multiband linear predictive speech coding at very low bit rates," Vision, Image and Signal Processing, IEE Proceedings, Vol.141 Issue:5, pp.289-296, October 1994.
    [32]Haiyun Yang, Soo Ngee KOH, P. Sivaprakasapillai, Shuji Doshita, "A 5.4 kbps Speech Coder Based on Multi-Band Excitation and Linear Predictive Coding," In IEEE Region 10's Ninth Annual International Conference (TENCON'94) Theme:Frontiers of Computer Technology Proceeding of 1994, pp.417-421, Sinapore City, Sinapore,1994.
    [33]Huijuan Cui, Kun Tang, Haisheng Zheng, "Research on MBE Algorithm at Bit Rate 800bps-2.4kbps Vocoder," In International Conference on Communication Technology (ICCT'98), pp.S36-09-1-S36-09-4, Beijing, China, October 1998.
    [34]Tian Wang, Kun Tang, Chongxi Feng, "A High Quality MBE-LPC-FE Speech Coder at 2.4kbps and 1.2 kbps," IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'96), pp.208-211, Atlanta, GA, USA,1996.
    [35]Amitava Das, Ajit V. Rao, Allen Gersho, "Enhanced Multiband Excitation Codig of Speech at 2.4 Kb/s with Discrete All-Pole Spectral Modeling," In Global Telecommnications Conference (GLOBECOM'94), pp.863-866, San Francisco, CA, USA,1994.
    [36]Keith A. Teague, "An enhanced mutiband excitation speech coder at 2400b/s," In IEE Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, Vol.1, pp.214-217, November 2002.
    [37]J.S Mao, Z. Chen, S.C. Chan, T.S. Ng, K.L. Ho, "A 2000bps LPC vocoder based on multiband Excitation," In IEEE international Symposium on Circuits and Syetems, Hong Kong, Vol.4, pp.2653-2656,June 1997
    [38]Xinyi Zhang, Taihui Jiang, Ming Zhong, "A 2400bps improved MBELP vocoder," In IEEE 5th International Conference on Signal Processing proceedings(ICSP2000), Vol.2, pp.702-705,August 2000.
    [39]K.F.Lam, C.F. Chan, "MBE synthesis of speech coded in LPC format," In International Conference on Acoustics, Speech, and Signal Processing (ICASSP-95), Detroit, Vol. l,pp.668-671,May 1995
    [40]Simao F.Campos Neto, Franklin L. Corcoran, " Performance assessment of 4.8 kbit/s AMBE coding under aeronautical environmental conditions," IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 96), Vol.1, pp.499-502, May 1996
    [41]Cheung-Fat Chan, Wai-Kwong Hui, "Wideband enhancement of narrowband coded speech using MBE re-synthesis," In 3rd International Conference on Signal Processing, Vol.1667-670, October 1996
    [42]Xu Pei-xia, Chen Zhou, Liu Wen-Fei, "Wavelet analysis based multiband excited vocoder," In. IEEE Region 10 Conference o Computer, Communication, Control and Power Engineering (TENCON'93), Vol.3, pp.349-352, October,1993
    [43]Mark W. Chamberlain, "A 600bps MELP vocoder for use on HF channels," In IEEE Military Communications Conference (MILCOM 2001) Communications for Network-Centric Operations:Creating the Information Force. Vol.1, pp.447-453,2001
    [44]Y.P. Hong, J.S. Yang, "Voice suprted multimedia service on data communication," IEEE Region 10's Ninth Annual International Conference.Theme:Frontiers of Computer Technology (TENCON'94), Vol.1, pp.431-435,August 1994.
    [45]Zhang Jinjie, Cao Zhigang, "Investigation of IMBE Parameters'sensitivity to noise," In Global Telecommnications Conference (GLOBECOM'98), pp.3734-3739, Sydney, NSW, Australia,1998.
    [46]P. Bhattacharya, M.Singhal, Sangeetha, "An analysis of the weaknesses of the MBE coding scheme," In IEEE International Conference on Personal Wireless Communication(ICPWC'99), pp.419-422, February 1999.
    [47]Haiyun Yang, Soo Ngee KOH, Vimal Kishore DUBEY, "Performance of pitch Synchornous Multi-Band (PSMB) Speech Coder with Error-correction Coding," In Global Telecommunications Conference (GLOBECOM'95), pp.1590-1594, Singapore city, Singapore,1995.
    [48]C.-F. Chan, E.W.M. Yu, "Improving pitch estimation for efficient multiband excitation coding of speech," IEEE electronics letters, Vol.32, No.10, pp.870-872, May 1996.
    [49]Jianlin Hu, Sheng Xu, Jian Chen, "A modified pitch detection algorithm," IEEE Communication Letters, Vol.5,Issue.2, pp.64-66, February 2001.
    [50]Haiyun Yang, Soo-Ngee Koh, Pratab Sivaprakasapilai, "Pitch Synchronous Multi-Band (PSMB) Speech Coding," In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'95), pp.516-519, Detroit, MI, USA,1995.
    [51]B.M.G Cheetham, X.Q.Sun, PA, "Barrett. Phase Modelling for Low Bit-Rate Speech Coding," In IEE Seminar Speech Coding for Algorithms for Radio Channels, pp. 5/1-5/5, London UK,2000.
    [52]B.M.G Cheetham, H.B. Choi, X.Q.Sun, C.C.Goodyear, F.Plante, W.T.K.Wong, "All-Pass Excitation Phase Modelling for Low Bit-Rate Speech Coding," In IEEE International Symposium on Circuits and systems, pp.2633-2636, Hong Kong,China, 1997.
    [53]Masayuki Nishiguchi, Jun Matsumoto, Ryoji Wakatsuki, Shinobu Ono, "Vector quantization MBE with simplied V/UV division at 3.0 kbps," In IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP-93), Vol.2, pp.151-154, Minneapolis MN, April 1993.
    [54]Amitava Das, Ajit V.Rao, Allen Gersho, "Variable-dimention vector quantization of speech spectra for low-rate vocoders," Data Compression Conference (DCC'94), Proceedings, pp.420-429, March 1994
    [55]Amitava Das, Allen Gersho, "Variable Dimension Spectral Coding of Speech at 2400 bps and Below with Phonetic Classification," Proc. IEEE Conf. Acoust., Speech, Signal Processing, Detroit, vol.1, pp.492-495, May 1995.
    [56]Amitava Das, Ajit V. Rao, Allen Gersho, "Variable Dimension Vector Quantization", Proc. IEEE Signal Processing letters, vol.3, No.7, pp.200-202,July 1996.
    [57]Chunyan Li, Eyal Shlomot, Vladimir Cuperman, "Quantization of variable dimension spectral vectors," IEEE Conf. Record of the Thirty-second Asilomar conference on Signals, Systems& Computers, Vol.1, pp.352-356, Nov.1998
    [58]Chunyan Li, Peter Lupini, Eyal Shlomot, Vladimir Cuperman, "Coding of variable dimension speech spectral vectors using weighted nonsquare transform vector quantization," IEEE Trans. on speech and audio processing, Vol.9, No.6, pp.622-630, September 2001
    [59]Wai C. CHU, "A novel approach to variable dimension vector quantization of Harmonic magnitudes," Proceedings of the 3rd International Symposium on Image and Signal Processing and Analysis (ISPA 2003), Vol.1, pp.537-542, Sept.2003.
    [60]Imola K.Fodor, "A survey of dimension reduction techniques,"// http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.122.201&rep=rep1&type=pdf
    [61]Miguel A. Carreira-Perpinan, "A review of Dimension Reduction Techniques,"// http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.84.104&rep=rep1&type=pdf.
    [62]Kondoz A.M.. Digital Speech Coding for Low Bit Rate Communication Systems. Second Edition. Chichester:John Wiley& Sons, Ltd.,2004.
    [63]Chu Wai C.. Speech coding algorithms:Foundation and evolution of standardized coders[M]. Chichester:John Wiley& Sons, Ltd.,2003.
    [64]赵志杰,陈贺新,桑爱军.三维矩阵可变分割彩色图像压缩编码[J1.吉林大学学报(工学版),2009,39(01):194-197.
    [65]Han Da-han, Cui Hui-juan, Tang Kun. Multimode Quantization of LP Parameters in MBE-LPC-FE Coder[C]//IEEE International Conference:IEEE Press,2002.1, pp:19-22.
    [66]李靓,鲍长春,白燕宁.一种高效、低存储的线谱频率参数矢量量化器[J].北京工业大学学报,2005,31(2):130-135.
    [67]李晔,彭坦,许明,计哲,崔慧娟,唐昆.带有帧间级间预测的线谱频率参数多级矢量量化技术[J].清华大学学报(自然科学版),2009,49(7):965-967.
    [68]Pasi Franti, "Genetic algorithm with deterministic crossover for vector quantization," Pattern Recognition Letters 21(2000), pp.61-68,2000.
    [69]W.B.Mikhael, V. Krishnan, "Iterative algorithm for design of vector quantization codebooks in nonorthogonal domains," In IEEE electronics letters, Vol.38, No.12, pp.607-608, June 2002.
    [70]Allen Gersho, Shihua Wang, "Vector Quantization Techniques in Speech Coding," http://code.ucsd.edu/-zeger/publications/conferences/GeWaZe92-BookChapter/GeWaZ e92-BookChapter.pdf.
    [71]Paliwal K. K., Atal B.S.. Efficient vector quantization of LPC parameters at 24 bits/frame. IEEE Transactions on Speech and Audio Processing,1993,1(1):3-14.
    [72]Dongbo Liu, Guorong Liu, Miaohua Yu, Yingxu Wang, "An Image Retrieval Method Using Modified LBG Algorithm", In International Conference on Computer Science and Software Engineering, pp.528-531,2008.
    [73]Jim Z.C. Lai, Yi-Ching Liaw, Julie Liu, "A fast VQ codebook generation algorithm using codeword displacement," ScienceDirect Pattern Recognition 41(2008), pp.315-319,2008.
    [74]Timo Kaukoranta, Pasi Franti, Olli Nevalainen, "A fast exact GLA based on code vecor activity detection," IEEE Transactions on Image Processing, Vol.9, No.8, pp.1337-1342, August 2000.
    [75]Y Linda, A Buzo, R M Gray. An algorithm for vector quantizer design[J]. IEEE Trans on Communication,1980,28(1):84-95
    [76]王茂芝,徐文皙.LBG算法对初始码书敏感的实验性能分析[J].物探化探计算技术,2004,26(4):374-378
    [77]J. Feng, K-T Lo, "Feature-based initialization for image vector quantization codebook generation," IEEE electronics letters, Vol.36, No.24, pp.2005-2006, November 2000.
    [78]C.Chehwang, "Initial codebook design by modified splitting method," In IEEE electronics letters, Vol.38, No.12, pp.606-607, June 2002.
    [79]Huilin Xiong, M.N.S.Swamy, M.O. Ahmad, "Competitive Splitting for Codebook Initialization," IEEE signal processing letters, Vol.11, No.5, pp.474-477, May 2004
    [80]Li Bingbing, Chang Yilin, Cheng Jun, Hu Zheng, "An improved initial codebook generating method-orthogonal Incerment splitting algorithm," Proceedings of IEEE International Conference on Information Engineering, pp.613-616, July 1995.
    [81]Stephen So, Paliwal Kuldip K, "Efficient product code vector quantization using the switched split vector quantiser," (2007)[2010]//http://www.elsevier.com/locate/dsp
    [82]Roweis S T, "Factorial models and refiltering for speech separation and denoising," In:Proceedings of the 8th European Conference on Speech Communication and Technology, Geneva, Switzerland:ISCA,2003.1009-1012
    [83]许晓斌,丁丰,林碧琴,袁保宗.“基于K-means算法的语境相关矢量量化”,自 动化学报,2000,26(3):369-372.
    [84]李碧,林土胜,廖亮.初始码字间距最大化的矢量量化码书设计算法[J].小型微型计算机系统,2009,30(4):780-783
    [85]Katsavounidis Ioannis, Kuo C-C Jay, Zhang Zhen. A new initialization technique for generalized Lloyd iterarion," IEEE Signal Processing Letters, Vol.1, No.10, pp.144.146,Oct.1994.
    [86]李弼程,文超,平西建.两个优于分裂法的初始码书设计方法[J].中国图象图形学报,2000,5(A)(1):48-51
    [87]Riskin Eve A, Gray Robert M, "A greedy tree growing algorithm for the design of variable rate vector quantizers," IEEE Trans. On Signal Processing,1991,39(11):
    [88]KarayiannisN B, P in2I Pai,"Fuzzy vector quantization algorithmand their app lication in image compression," IEEE Transactions on Image Processing,1995,24 (9): 1193-1201.
    [89]Zhang Jihong, Li Xia, Xie Wiexin, "A stochastic competitive learning vector quantization algorithm for image coding," Acta Electronica Sinica,2000,28 (10): 23-26.
    [90]Yong Zhengzheng, Luo Ping, Wu Q H, et al, "A new vector quantization image coding algorithm based on evo lutionary simulated annealing,"Acta Electronica Sinica, 2001,29 (5):653-656.
    [91]Li Xia, Luo Xue2hui, Zhang Jihong, "Codebook design for image vecto r quantization with ant colony optim ization," Acta Electronica Sinica,2004,32 (7): 1082-1085.
    [92]Jungeun Parkj, Sangwon Kang, "Block Constrained Trellis Coded Vector Quantization of LSF Parameters for Wideband Speech Codecs," ETRI Journal,2008, 30 (5):738-740
    [93]Saikat Chatterjee, T.V.Sreenivas, "Optimum switched split vector quantization of LSF parameters," Signal Processing journal, Elsevier,2008,88:1528-1538
    [94]Timo Kaukoranta, Pasi Franti, Olli Nevalainen, "Vector quantization by lazy pairwise nearest neithbor method," Optiacl Engineering, Vol.38, No.11, pp.1862-1868, November 1999.
    [95]Equtitz William H. a new vector quantization clustering algorithm[J]. IEEE Transactions on Acoustics, speech and Signal Processing,1989,37(10):1568-1575.
    [96]Pasi Franti, Timo Kaukoranta, "Fast Implementation of the Optimal PNN Method," Proceedings of the 1998 IEEE International Conference on Image Processing (ICIP-98), Chicago, Illinois, USA, IEEE Computer Society,1998: 104-108.
    [97]Pasi Franti, T Kaukoranta, D.-F. Shen, et al, "Fast and memory efficient implementation of the exact PNN[J]. IEEE Transactions on Image Processing,2000, 9 (5):773-777.
    [98]Pasi Franti, Olli Virmajoki, Ville Hautamaki, "Fast PNN-based Clustering Using K-nearest Neighbor Graph,"//Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), Melbourne, Florida, USA, IEEE Computer Society,2003: 525-528.
    [99]Olli Nevalainen, Pasi Franti, Timo Kaukoranta, "Practical mehthods for speeding-up the pairwise nearest neighbor method," Optiacl Engineering, Vol.40, No.11, pp.2495-2504, November 2001.
    [100]Pasi Franti, Timo Kaukoranta, "Fast Imlementation of the Optimal PNN method," In International Conference on Image Processing (ICIP 98), Vol.3, pp.104-108, October 1998.
    [101]吴婷婷,曾毓敏,李平.利用PNN算法改进初始码书的GLA算法[J].数据采集与处理,2007,22(2):242-244
    [102]王向阳,左苛可.一种新的混合域矢量量化图像压缩编码算法[J].小型微型计算机系统,2009,30(6):1142-1147
    [103]刘丽娟,邹雪城.快速编码算法的VLSI体系结构设计[J].华中科技大学学报(自然科学版),2006,34(1):46-49
    [104]王磊,戚飞虎.矢量量化的渐进构造模糊聚类算法[J].上海交通大学学报,1999,33(5):523-525
    [105]张一,成礼智.结合改进的PCC算法与LBG算法的高效码书设计[J].系统工程与电子技术,2006,28(3):470-473
    [106]张正阳,段勇,吴成柯.一种快速高效的VQ初始码书设计方法[J].西安电子科技大学学报,1998,25(5):549-554.
    [107]Eluned S Parris, Michael J Carey, "Language independent gender identification," In IEEE international conference on Acouctics Speech, and Signal Processing (ICASSP-96), Atlanta, Vol.2, pp.685-688, May 1996.
    [108]Huaiyu Dai,Zhigang Gao, "A Joint Speech Coding-Enhancement Algorithm for MBE Vocoder," In International Conference on Communication Technology (ICCT'98), pp.S36-05-1-S36-05-4, Beijing, China,1998.
    [109]Kyu J. Han, Samuel Kim, Shrikanth S. Narayanan, " Strategied to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization," IEEE Trans. on Audio, Speech, and Language processing, Vol.16, No.8, pp.1590-1600, November 2008
    [110]Zahir Raza, Fady Alajaji, Tamas Linger, "Design of Sample Adaptive Product Quantizers for Noisy Channels," IEEE Transactions on Communications, Vol.53, No.4, pp.576-580, April 2005.
    [111]A.Zahedi, K. Pahlavan, "Voice and data integration on TCP/IP wireless networks," In 8th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC'97), Vol.2, pp.678-682, September 1997.
    [112]Jorg-Martin Muller, Bertram Wachter, "A Codec Candidate for the GSM Half Rate Speech Channel, In IEEE International Conference on Acoustics," Speech and Signal Processing (ICASSP'94), pp.I-257-I-260, Adelaide, SA, Australia,1994.
    [113]R.C. Lamare, A.Alcaim, "Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec," IEE Proc.-vis Image Signal Process., Vol.152, No.l, pp.74-86, February 2005
    [114]Rainer Bauer, Joachim Hagenauer, "Symbol-by-symbol MAP decoding of variable length codes," http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.35.119&rep=rep1&type=pdf.
    [115]Kiyoshi Mizui, Syuuichi Wakabayashi, Masatoshi Satoh, Yasuhiro Minami, Masao Nakagawa, "Variable Bit Rate PARCOR-VQ Hybird Vocoder," In Global Telecommunications Conference, (GLOBECOM'89) and Exhibition Communications Technology for the 1990s and Beyond, pp.1885-1889, Dallas, TX, USA,1989.
    [116]Engin Erzin, Arun Kunar, Allen gersho, "Natural Quality Variable-Rate Spectral Speech Coding Below 3.0 Kbps," In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'97), pp.1579-1582, Munich, Germeny,1997.
    [117]Eric W.M.Yu, Cheung-Fat Chan, "Variable bit rate MBELP speech coding via V/UV distribution dependent spectral quantization," IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP97), Vol.2, pp.1607-1610, April 1997.
    [118]Ahsun H. Murad, Thomas E. Fuja, "Joint source-channel decoding of variable-length encoded sources," In Information Theory Workshop, pp.94-95, June1998.
    [119]Khalid Sayood, Hasan H.Out, Nejat Demir, "Joint source/channel coding for variable length codes," IEEE Transactions on Communications, Vol.48, No.5, pp.787-794, May 2000.
    [120]Jianjun Liu, Guofang Tu, Can Zhang, Yang Yang, "Joint source and channel decoding for variable length encoded Turbo codes," EURASIP Journal on Advances in Signal Processing,Vol.2008, Article ID 149839,7 pages.
    [121]James G Dunham, Robert M. Gray, "Joint source and noisy channel trellis encoding," IEEE Transactions on Information Theory, Vol.IT-27, No.14, pp.516-519, July 1981.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700