Research
My research focuses on computer architecture, hardware/software co-design, near-memory processing and quality of service optimization. I am developing novel hardware and software solutions on accelerating large scale emerging applications, such as precision health and generative artificial intelligence (GenAI) workloads.
|
|
PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference
Yufeng Gu*, Alireza Khadem*, Sumanth Umesh, Ning Liang, Xavier Servot, Onur Mutlu, Ravi Iyer, and Reetuparna Das
30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2025
ASPLOS 2025 /
BibTeX /
Paper /
Slides /
Code
|
|
GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing Analysis
Yufeng Gu, Arun Subramaniyan, Tim Dunn, Alireza Khadem, Kuan-yu Chen, Somnath Paul, Md Vasimuddin, Sanchit Misra, David Blaauw, Satish Narayanasamy, Reetuparna Das
ACM/IEEE 50th International Symposium on Computer Architecture (ISCA), 2023
ISCA 2023 /
BibTeX /
Paper /
Slides /
Code /
Lightning Talk
Communications of ACM Research Highlights /
Technical Perspective
|
|
GenomicsBench: A Benchmark Suite for Genomics
Arun Subramaniyan, Yufeng Gu, Tim Dunn, Somnath Paul, Md Vasimuddin, Sanchit Misra, Satish Narayanasamy, David Blaauw, Reetuparna Das
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2021
ISPASS 2021 /
BibTeX /
Paper /
Slides /
Code /
Lightning Talk
|
|
Multi-site fMRI Analysis Using Privacy-preserving Federated Learning and Domain Adaptation: ABIDE Results
Xiaoxiao Li, Yufeng Gu, Nicha Dvornek, Lawrence Staib, Pamela Ventola, James S. Duncan
Medical Image Analysis, 2020, IF=11.148
MedIA 2020 /
BibTeX /
Paper /
Code /
|
Awards and Honors
- Distinguished Artifact Honorable Mention in HPCA 2025, selected from 3/29 artifacts. (Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing)
- Communications of ACM Research Highlights, selected among 24 papers from all ACM conferences in 2023. (GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing Analysis)
- Rackham Graduate Student Research Grant at University of Michigan. (Pangenome Sequence Alignment Benchmark Suite, $3,000)
- Rackham Conference Travel Grant at University of Michigan, 2023, 2025.
- Student Travel Grant for ISCA 2023, HPCA 2025.
- Summer@EPFL Fellowship (2% applicants awarded), 2019.
- Tang Lixin Fellowship (60/60,000 students awarded), 2017, 2018, 2019.
- Outstanding Student Leaders at Zhejiang University (3% applicants awarded), 2017, 2019.
- First-Class Scholarship for Outstanding Students at Zhejiang University (2% applicants awarded), 2017.
|
Talks
- [03/2025] CXL-enabled PIM system for LLM inference at the MCCSys workshop co-located with ASPLOS 2025.
- [03/2024] Genomics Benchmark Suite and Accelerator Design on the Computer Architecture Seminar about at UCF.
- [02/2024] Genomics Benchmark Suite and Accelerator Design at Cornel University.
- [12/2023] Genomics Benchmark Suite and Accelerator Design on the Peisu Xia Forum at ICT, CAS.
|
Industry Experiences
- Intel Labs, Graduate Technical Intern, June 2022 - Aug. 2022.
- Tenstorrent Inc., Performance Architect Intern, May 2023 - Aug. 2023.
|
|