Zipper: Latency-Tolerant Optimizations for High-Performance Buses

Jan 1, 2024·
Shibo Chen
Shibo Chen
,
Hailun Zhang
,
Todd Austin
· 0 min read
Abstract
As heterogeneous designs take over the world of hardware designs, the data bus plays a vital role in interconnecting hosts and accelerators. While past works have emphasized increasing communication bandwidth for data-hungry workloads, this work focuses on optimizing latency for latency-sensitive acceleration applications. We first study the pattern of various accelerator workloads and demonstrate that various optimization opportunities exist to reduce the communication latency overhead. To help developers exploit these opportunities, we introduce Zipper—a protocol optimization layer that reduces communication costs by enabling device and request level parallelism and exploiting data locality for existing bus standards. We applied Zipper to two domains and implemented the end-to-end system on a heterogeneous hardware platform with an integrated FPGA. Our physical experiments show that Zipper provides 8x speedup for one accelerator with 4.3% logic overhead and 1.5x speedup for another with 0.9% logic overhead.
Type
Publication
In Proceedings of The Asia and South Pacific Design Automation Conference 2025
Shibo Chen
Authors
PhD Candidate
My research interests include heterogeneous computation, datacenter architecture and agile hardware design. My advisor is Prof. Todd Austin. I am a student in ADA (The Center for Applications Driving Architectures) and CELAB (Computer Engineering Lab) at University of Michigan.