KELEFOURAS, Vasileios, KRITIKAKOU, Angeliki and GOUTIS, Costas (2014). A Matrix--Matrix Multiplication methodology for single/multi-core architectures using SIMD. The Journal of Supercomputing, 68 (3), 1418-1440. [Article]
Documents
18355:430420
PDF
Kelefouras-Matrix-MatrixMultiplicationMethodologyforSIngleMulti-Core(AM).pdf - Accepted Version
Available under License All rights reserved.
Kelefouras-Matrix-MatrixMultiplicationMethodologyforSIngleMulti-Core(AM).pdf - Accepted Version
Available under License All rights reserved.
Download (1MB) | Preview
Abstract
In this paper, a new methodology for speeding up Matrix–Matrix Multiplication
using Single Instruction Multiple Data unit, at one and more cores having a
shared cache, is presented. This methodology achieves higher execution speed than
ATLAS state of the art library (speedup from 1.08 up to 3.5), by decreasing the number
of instructions (load/store and arithmetic) and the data cache accesses and misses in
thememory hierarchy. This is achieved by fully exploiting the software characteristics
(e.g. data reuse) and hardware parameters (e.g. data caches sizes and associativities)
as one problem and not separately, giving high quality solutions and a smaller search
space.
More Information
Statistics
Downloads
Downloads per month over past year
Metrics
Altmetric Badge
Dimensions Badge
Share
Actions (login required)
View Item |