Microarchitecture support for improving the performance of load target prediction

Chung Ho Chen, Akida Wu

研究成果: Conference article同行評審

6 引文 斯高帕斯(Scopus)

摘要

We present a load target prediction scheme that mitigates the impact of load latency for modern microprocessors. The scheme uses a cache-like buffer to provide the base address, offset, and operand size at the instruction fetching stage of a pipeline so that a load target address can be computed earlier at the decode stage. With the dynamic use of a load stride, the scheme has achieved a prediction rate that is 15% higher than a previously proposed approach. By providing a 128-entry direct-mapped load-prediction buffer, two adders, and two forwarding paths, for a 4-fetch processor the scheme provides an average speedup of 10% to 32% in performance improvement as the data cache latency increases from 2 cycles to 4 cycles. A bit-array design that supports multiple-cast writes and eliminates associative logic commonly used in base register caching is developed for the prediction scheme.

原文English
頁(從 - 到)228-234
頁數7
期刊Proceedings of the Annual International Symposium on Microarchitecture
出版狀態Published - 1997 十二月 1
事件Proceedings of the 1997 30th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-30 - Triangle Park, NC, USA
持續時間: 1997 十二月 11997 十二月 3

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Software

指紋 深入研究「Microarchitecture support for improving the performance of load target prediction」主題。共同形成了獨特的指紋。

引用此