Aiming at the problems of ultra-long points FFT implementation and computational efficiency encountered during the migration of DSP platform algorithms, this article combines TI's TMS320C6678 DSP, we use the FFT decomposition algorithm and the efficient memory access efficiency of the L2 memory segment to design a parallel processing method for ultra-long points FFT calculation by paralleling the EDMA movement of the DSP memory data and the FFT decomposition calculation.The specific implementation process of the method is described by 262144-points FFT calculation, and the calculation result of DSP is compared with the result of MATLAB, and the calculation error of the method is 10-4.Finally, on the DSP platform, when the number of FFT points is the same, the calculation time of directly calling the FFT library function is compared with the calculation time using this method, which shows that it has better calculation performance in the ultra-long points FFT calculation.
Key words
real-time signal processing /
ultra-long points FFT calculation /
DSP /
TMS320C6678 /
EDMA /
parallel computing
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] 李启虎.进入21世纪的声纳技术[J].应用声学, 2002, 21(1):1318.
[2] 侯自强.微处理机和数字技术在声信号处理中的应用[C]//第二届应用声学学术会议论文集, 1984:123131.
[3] 张培钟, 缪晨.雷达信号处理中大数据量FFT的实现[J].微波学报, 2017, 8(33).
[4] Texas Instruments Inc.TMS320C66x DSP CPU and Instruction Set Reference Guide, 2010.
[5] Texas Instruments.TMS320C6678 multicore fixed and floating-point digital signal processor data manual, 2012.
[6] Texas Instruments.Keystone architecture multicore shared memory controller (MSMC) user guide, 2012.
[7] Texas Instruments.Very large FFT for TMS320C6678 processors , 2015:15.
[8] 贺荣.基于多核 DSP 芯片的多载波通信波形实现[J].电子元器件与信息技术, 2020, 4(3):6466.