PDF(1302 KB)
使用多播自适应路由加速缓存一致性片上网络的监听及监听响应过程
胡东伟, 巴晓辉, 刘耿亭, 王力男, 雷岳俊
集成电路与嵌入式系统 ›› 2025, Vol. 25 ›› Issue (8) : 81-90.
PDF(1302 KB)
PDF(1302 KB)
使用多播自适应路由加速缓存一致性片上网络的监听及监听响应过程
Accelerating the snooping & snooping response process of cache-coherent network-on-chip with multicast adaptive routing
In Cache-Coherent Network-on-Chip (NoC) of many-core CPU, the snooping and snooping response Process (SNP Process) incurs long latency. To address this, two techniques: multicast routing and adaptive routing are proposed in this paper. According to the requirements of these two techniques, the NoC packet formats for Snooping Request Channel (SNP REQ Ch) and Snooping Response Channel (SNP RESP Ch) are proposed, and furthermore, the NoC routers of SNP REQ Ch and SNP RESP Ch are VLSI implemented. The implementation results show that the routers for both SNP REQ Ch and SNP RESP Ch are of 85 940.3 μm2 or 103 518.5 μm2, while an 8×8 network occupies 5.57 mm2, which is feasible for large-scale chips. Simulations are employed to compare the latencies of 4 configurations: unicast determined routing, unicast adaptive routing, multicast determined routing, and multicast adaptive routing. The simulation results show that the latency of SNP Process with multicast adaptive routing could be cut by 45% for a single snooping request comparing to that with unicast determined routing, resulting in a much shorter latency than DDR/HBM access, and by 73% for 32 consecutive snooping requests with outstanding technique employed at the Point of Coherency (PoC), which validate the effectiveness of the proposed techniques.
network-on-chip / cache coherency / adaptive routing / multicast routing
| [1] |
Intel Delivers Cutting-Edge Process Technologies to the Data Center with Intel 18A and Advanced Chiplet Packaging[OL]. https://www.intel.cn/content/dam/www/central-libraries/us/en/documents/2024-02/intel-tech-clearwater-wp.pdf.
|
| [2] |
胡东伟, 尚德龙, 张勇, 等. 时钟及面积优化的可配置片上网络路由器[J]. 西安电子科技大学学报, 2022, 49(2):130-139.DOI: 10.19665/j.issn1001-2400.2022.02.015.
|
| [3] |
|
| [4] |
|
| [5] |
张阿敏. 基于片上网络的众核高速缓存一致性研究[D]. 合肥: 合肥工业大学, 2018.
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
AMBA CHI Architecture Specification[S/OL]. https://developer.arm.com/documentation/ihi0050/latest/.
|
| [13] |
|
| [14] |
The RISC-V Instruction Set Manual: Volume I[S/OL]. https://lists.riscv.org/g/tech-unprivileged/attachment/535/0/unpriv-isa-asciidoc.pdf.
|
| [15] |
|
| [16] |
AMBA AXI Protocol Specification[S/OL]. https://developer.arm.com/documentation/ihi0022/latest/.
|
| [17] |
胡东伟, 巴晓辉, 雷岳俊, 等. 空间耦合最短路径自适应路由的路径条数及可缓冲的数据包数目[J]. 中央民族大学学报(自然科学版), 2023, 32(4):69-77.
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
/
| 〈 |
|
〉 |