Computer System Architecture Lab
Computer System Architecture Lab
Home
News
Members
Publications
Research
Gallery
Contact
Light
Dark
Automatic
1
Performance Characterization and Optimization of LLM Inference on Tenstorrent AI Accelerators
Sparse matrix-vector multiplication (SpMV) is a fundamental operation across diverse domains, including scientific computing, machine …
Jangho Lim
,
Dongin Shin
,
Uichan Kim
,
Jinhyeok Choi
,
Kyungmin Kim
,
Snagwon Shin
,
Snagwoo Park
,
Gunjae Koo
,
Taeweon Suh
PDF
Cite
Project
Slides
DOI
Slide Show
Three Birds, One Stone: Fast, Accurate-aware and Cost-Efficient Accelerator for Ternary LLM
Sparse matrix-vector multiplication (SpMV) is a fundamental operation across diverse domains, including scientific computing, machine …
Wonseok Jung
,
Junseok Kang
,
Sangwon Shin
,
Hongjun Um
,
Jangho Lim
,
Gunjae Koo
,
Yongjun Park
,
Sangwoo Park
,
Taeweon Suh
PDF
Cite
Project
Slides
DOI
Slide Show
SumcheckPIM: An Efficient HBM-Based PIM Architecture for Linear Complexity Zero Knowledge Proofs
Sparse matrix-vector multiplication (SpMV) is a fundamental operation across diverse domains, including scientific computing, machine …
Sunchae Kim
,
Taewoon Kang
,
Sangwon Shin
,
Taeweon Suh
,
Yibin Yang
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
FINEA: An Efficient Neural Network Accelerator Exploiting Factorized Input Features
Modern deep neural network (DNN) models increasingly adopt quantized data formats to alleviate the computational burdens of convolution …
Yujin Kim
,
Chanhun Jeong
,
Yunho Oh
,
Myung Kuk Yoon
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks
Complex-Valued Neural Networks (CVNNs) are an emerging class of deep learning models that process data with both real and imaginary …
Ji Yeong Yi
,
Eunbi Jeong
,
Sunghee Yum
,
Jane Rhee
,
Sangun Choi
,
Gunjae Koo
,
Yunho Oh
,
Myung Kuk Yoon
PDF
Cite
Project
Slides
DOI
Slide Show
Poster
SSFFT: Energy-Efficient Selective Scaling for Fast Fourier Transform in Embedded GPUs
Fast Fourier Transform (FFT) is critical in applications such as signal processing, communications, and AI. Embedded GPUs are often …
Dongwon Yang
,
Jaebeom Jeon
,
Minseong Gil
,
Junsu Kim
,
Seondeok Kim
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
Slides
DOI
Slide Show
SparsePIM: An Efficient HBM-Based PIM Architecture for Sparse Matrix-Vector Multiplications
Sparse matrix-vector multiplication (SpMV) is a fundamental operation across diverse domains, including scientific computing, machine …
Taewoon Kang
,
Geonwoo Choi
,
Taeweon Suh
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
Hierarchical Traversal Stack Design Using Shared Memory for GPU Ray Tracing
Ray tracing is widely used to generate photorealistic images by tracing the paths of light rays through a scene and their interactions …
Eunsoo Jung
,
Eunbi Jeong
,
Gunjae Koo
,
Yunho Oh
,
Myung Kuk Yoon
PDF
Cite
Project
Slides
DOI
Slide Show
HyMM: A Hybrid Sparse-Dense Matrix Multiplication Accelerator for GCNs
Graph convolutional networks (GCNs) are emerging neural network models designed to process graph-structured data. Due to massively …
Hunjong Lee
,
Jihun Lee
,
Jaewon Seo
,
Yunho Oh
,
Myung Kuk Yoon
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
Poster
Coldmap: Extending SSD Lifetime Exploiting Multi-Page Mapping Information
Solid-state drives (SSDs) include flash translation layer (FTL) functions to manage the inherent characteristics of NAND flash memory. …
Jaewon Seo
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
»
Cite
×