Header menu link for other important links
X
XStream: Cross-core spatial streaming based MLC prefetchers for parallel applications in CMPs
Published in Institute of Electrical and Electronics Engineers Inc.
2014
Pages: 87 - 98
Abstract
Hardware prefetchers are commonly used to hide and tolerate off-chip memory latency. Prefetching techniques in the literature are designed for multiple independent sequential applications running on a multicore system. In contrast to multiple independent applications, a single parallel application running on a multicore system exhibits different behavior. In case of a parallel application, cores share and communicate data and code among themselves, and there is commonality in the demand miss streams across multiple cores. This gives an opportunity to predict the demand miss streams and communicate the predicted streams from one core to another, which we refer as cross-core stream communication. We propose cross-core spatial streaming (XStream), a practical and storage-efficient cross-core prefetching technique. XStream detects and predicts the cross-core spatial streams at the private mid level caches (MLCs) and sends the predicted streams in advance to MLC prefetchers of the predicted cores. We compare the effectiveness of XStream with the ideal cross-core spatial streamer. Experimental results demonstrate that, on an average (geomean), compared to the state-of-the-art spatial memory streaming, storage efficient XStream reduces the execution time by 11.3% (as high as 24%) and 9% (as high as 29.09%) for 4-core and 8-core systems respectively. © 2014 ACM.
About the journal
JournalData powered by TypesetParallel Architectures and Compilation Techniques - Conference Proceedings, PACT
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN1089795X