(Formula presented.): a data dependence and stride reference patterns profiling infrastructure

Hairong Yu, Guohui Li, Lih Chyun Shu

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)


Despite the widespread use of multi-core processors in modern computer systems, developing software tools so as to make best use of available computing resources has never been more urgent. This is because a considerable amount of spurious dependence and cache misses lurking in general-purpose applications restricts seriously the extraction of potential parallelism on the nowadays prevalent multi-core machines. Existing tools are limited in their ability to thoroughly detect data dependence and provide prefetched objects simultaneously. Further, some of the tools are unable to profile large-scale applications. To address this problem, we propose a novel profiler, called (Formula presented.) , that performs both data dependence and stride reference profiling. Data dependence profiling employs a hash-based scheme to detect actual data dependence while filtering out useless dependence via timestamps. Stride reference profiling employs value profiling to profile the stride pattern for each dynamic load and select the profitable loads as prefetched objects for compilers. To demonstrate the effectiveness of (Formula presented.) , we have evaluated it using several SPEC CPU2006, MPI2007 and OMP2012 benchmarks on an Intel i7-4700 machine. Experimental results show that (Formula presented.) produces accurate profiling results, including expected data dependence and prefetched objects, which in turn contributes to more opportunities for extracting parallelism.

頁(從 - 到)770-788
期刊Journal of Supercomputing
出版狀態Published - 2016 2月 1

All Science Journal Classification (ASJC) codes

  • 軟體
  • 理論電腦科學
  • 資訊系統
  • 硬體和架構


深入研究「(Formula presented.): a data dependence and stride reference patterns profiling infrastructure」主題。共同形成了獨特的指紋。