高性价比DSP中指令Cache的研究与设计
[Abstract]:System throughput is an important measure of processor performance, which is often limited by the capabilities of CPU. Studies have shown that the price difference between the fast memory chip and the next fastest speed level can be as high as 50% or 100%, but the memory can only get an additional 20% speed. Unfortunately, the speed of the CPU cannot be increased accordingly. So in order to develop a kind of DSP processor with high performance-price ratio, the design of Cache has become a key factor to improve the performance of processor system. The research of Cache has become a hot topic in the development of DSP processor. In this paper, the design of "two-stage buffer" architecture is presented for 32-bit fixed-point DSP processor AXP32 (the main frequency of AXP32 can be up to 150 MHz, peripheral frequency up to 40 MHz),). In order to improve the speed of CPU to refer to external memory, an instruction Cache, is inserted between CPU and external memory, which is called "first stage cache". The design capacity of the instruction Cache is 512B, the block size is 16B, and the image rule is the direct image. According to the basic flow of digital IC design, the instruction Cache is divided into two modules: data part and control part. The data part mainly realizes the function of searching and comparing, while the control part realizes the next operation of instruction Cache according to the lookup result. The design of the "second level cache" is mainly based on the external SPI FLASH,. In order to speed up the communication between the DSP kernel and the external FLASH, an asynchronous FIFO. is inserted between the SPI peripheral and the DSP kernel. The asynchronous FIFO uses Graycode count to synchronize the read / write pointer, and gives a detailed circuit design and description for the logic of the read-write address and the judgment logic of the empty full flag. In the design of this subject, we use Verilog programming language to realize the design of this subject at RTL level, and use the NC-Verilog simulation tool of Cadence to simulate the function of instruction Cache and asynchronous FIFO. After the simulation is passed, Synopsys's Design Compiler synthesis tool is used to optimize the code design. The results are as follows: the total area of instruction Cache is 0.351 mm2, the power consumption is 38.85 mW, the maximum clock frequency is 150 MHz; asynchronous FIFO, the total area is 0.011 mm2, the power consumption is 452.95 渭 W, and the read clock frequency is up to 100 MHz..
【学位授予单位】:湘潭大学
【学位级别】:硕士
【学位授予年份】:2017
【分类号】:TP332
【参考文献】
相关期刊论文 前5条
1 程光伟;刘大伟;;基于FPGA的异步FIFO的研究和设计[J];工业仪表与自动化装置;2013年02期
2 王晓燕;杨先文;陈海民;;密码嵌入式处理器中高速缓存的研究与设计[J];计算机工程与设计;2012年08期
3 邬春学;华乐;;异步FIFO控制器的设计[J];微型机与应用;2012年04期
4 杨晓刚;屈凌翔;张树丹;;DSP中指令Cache的低功耗设计[J];计算机工程与应用;2011年32期
5 李红桥;肖建青;张洵颖;龚龙庆;;流水线处理器中Cache模块的设计[J];科学技术与工程;2010年32期
相关硕士学位论文 前10条
1 朱志强;基于28nm工艺的低功耗触发器设计及优化[D];安徽大学;2015年
2 张波;基于SOC异步FIFO的设计与形式验证[D];西安电子科技大学;2015年
3 欧阳邦见;多核X-DSPX共享存储部件的设计与实现[D];国防科学技术大学;2013年
4 袁潇;DSP处理器中数据Cache的设计和验证[D];西安电子科技大学;2013年
5 江滔;一种COFDM无线视频传输系统的设计与实现[D];华中科技大学;2013年
6 付波;基于dsp的两级cache低功耗研究与实现[D];南昌大学;2012年
7 李桦林;多核SoC中高带宽片外数据交换与处理技术研究[D];合肥工业大学;2012年
8 徐文昌;SoC调试跟踪系统的设计与实现[D];西安电子科技大学;2012年
9 李帅;基于Vera语言的GPS/北斗Ⅱ双模基带芯片的验证方法及实现[D];暨南大学;2011年
10 刘彬;异步FIFO的设计与形式化验证[D];国防科学技术大学;2011年
,本文编号:2358411
本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/2358411.html