Image credit: Unsplash

Duality Cache for Data Parallel Acceleration

D. Fujiki , S. Mahlke , R. Das

Jun 1, 2019

DOI

Image credit: Unsplash

Duality Cache for Data Parallel Acceleration

D. Fujiki , S. Mahlke , R. Das

Jun 1, 2019

DOI

Abstract

Duality Cache is an in-cache computation architecture that enables general purpose data parallel applications to run on caches. This paper presents a holistic approach of building Duality Cache system stack with techniques of performing in-cache floating point arithmetic and transcendental functions, enabling a data-parallel execution model, designing a compiler that accepts existing CUDA programs, and providing flexibility in adopting for various workload characteristics. Exposure to massive parallelism that exists in the Duality Cache architecture improves performance of GPU benchmarks by 3.6× and OpenACC benchmarks by 4.0× over a server class GPU. Re-purposing existing caches provides 72.6× better performance for CPUs with only 3.5% of area cost. Duality Cache reduces energy by 5.8× over GPUs and 21× over CPUs.

Type

Conference paper

Publication

2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA)

Date

June, 2019