> >CRPC-TR 92190 "Optimizing for Parallelism and Data Locality" McKinley, > >Kennedy > >Abstract: > > > >Previous research has used program transformation to introduce > >parallelism and to exploit data locality. Unfortunately, these two > >objectives have usually been considered independently. This work > >explores the tradeoffs between effectively utilizing parallelism and > >memory hierarchy on shared-memory multiprocessors. We present a > >simple, but surprisingly accurate, memory model to determine cache > >line reuse from both multiple accesses to the same memory location and > >from consecutive memory access. The model is used in memory > >optimizing and loop parallelization algorithms that effectively > >exploit data locality and parallelism in concert. We demonstrate the > >efficacy of this approach with very encouraging experimental results. > > > >Keywords: optimization, parallelization, data locality, reuse, memory > >hierarchy, loop transformations