摘要
由于二维傅里叶变换计算量大,会导致在嵌入式应用过程中速度过慢。为此本文实验了一种基于Xilinx Zynq芯片的片内硬件加速实现方式,主要利用片内的可编程逻辑资源来完成变换过程中的大量计算,利用片内的处理器系统完成整个算法实现过程中的数据传输与调度。在获得FPGA提供的并行计算的速度优势同时,又保留了处理器系统软件开发的灵活性。借助于Xilinx提供的一维快速傅里叶变换IP核与Xillybus提供的总线方案,本文的实验通过软硬件结合的方式实现了二维傅里叶变换算法,与OpenCV计算比较,计算速度显著提高。
As the two-dimensional Fourier transform requires a lot of calculation,so it often encounter the problem of slow speed in embedded applications.In the paper,a hardware acceleration method based on the Xilinx Zynq chip is designed,which mainly uses the onchip programmable logic resources to complete the large number of computation in the transformation process and uses the on-chip processor system processing system to complete the entire algorithm to achieve the process of data transmission and scheduling.This method not only obtains the speed advantage of parallel computing provided by FPGA,but also retains the flexible characteristics of processor system software development.In this paper,the two-dimensional Fourier transform algorithm is implemented by combining the hardware and software with the one-dimensional fast Fourier transform IP core provided by Xilinx and the bus scheme provided by Xillybus.Comparing with the OpenCV library function,the calculation speed is obviously improved.
引文
[1]Russell M,Fischaber S.OpenCV based road sign recognition on Zynq[C]//IEEE International Conference on Industrial Informatics.IEEE,2013:596-601.
[2]张全,鲍华,饶长辉,等.GPU平台二维快速傅里叶变换算法实现及应用[J].光电工程,2016(2):69-75.
[3]李硕,王茜蒨.基于FPGA的二维FFT实现[C]//中国电子学会青年学术年会,2012.
[4]方睿,刘加贺,薛志辉,等.卷积神经网络的FPGA并行加速方案设计[J].计算机工程与应用,2015,51(8):32-36.
[5]陈实.大邻域图像处理硬件加速的研究[D].北京:清华大学,2009.
[6]吴良晶,曹云峰,丁萌,等.SoC FPGA的视觉算法加速系统设计[J].单片机与嵌入式系统应用,2016,16(11):58-62.
[7]王晓璐.基于Zynq的LS-SVM算法加速器设计[D].哈尔滨:哈尔滨工业大学,2015.
[8]Kryjak T,Komorkiewicz M,Gorgon M.Real-time hardwaresoftware embedded vision system for ITS smart camera implemented in Zynq SoC[J].Journal of Real-Time Image Processing,2016:1-37.
[9]Crockett L H,Elliot R A,Enderwitz M A,et al.The Zynq Book:Embedded Processing with the Arm Cortex-A9on the Xilinx Zynq-7000 All Programmable Soc[M].Strathclyde Academic Media,2014.
[10]Hunter T M,Denisenko D,Kannan S,et al.FPGA Acceleration of Multifunction Printer Image Processing using OpenCL,2014.