Computer System Architecture Lab
Computer System Architecture Lab
Home
News
Members
Publications
Research
Gallery
Contact
Light
Dark
Automatic
GPU
MOST: Memory Oversubscription-Aware Scheduling for Tensor Migration on GPU Unified Storage
Deep Neural Network (DNN) training demands large memory capacities that exceed the limits of current GPU onboard memory. Expanding GPU …
Junsu Kim
,
Jaebeom Jeon
,
Jaeyong Park
,
Sangun Choi
,
Minseong Gil
,
Seokin Hong
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
Project
DOI
SSFFT: Energy-Efficient Selective Scaling for Fast Fourier Transform in Embedded GPUs
Fast Fourier Transform (FFT) is critical in applications such as signal processing, communications, and AI. Embedded GPUs are often …
Dongwon Yang
,
Jaebeom Jeon
,
Minseong Gil
,
Junsu Kim
,
Seondeok Kim
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
Slides
DOI
Slide Show
Hierarchical Traversal Stack Design Using Shared Memory for GPU Ray Tracing
Ray tracing is widely used to generate photorealistic images by tracing the paths of light rays through a scene and their interactions …
Eunsoo Jung
,
Eunbi Jeong
,
Gunjae Koo
,
Yunho Oh
,
Myung Kuk Yoon
PDF
Cite
Project
Slides
DOI
Slide Show
Beyond VABlock: Improving Transformer Workloads through Aggressive Prefetching
The memory capacity constraint of GPUs is a major challenge in running large deep learning workloads with their ever increasing memory …
Jane Rhee
,
Ikyoung Choi
,
Gunjae Koo
,
Yunho Oh
,
Myung Kuk Yoon
PDF
Cite
Project
Project
DOI
VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing
The rapid advancement of Artificial Intelligence (AI) necessitates significant enhancements in the energy efficiency of Graphics …
Jaebeom Jeon
,
Minseong Gil
,
Junsu Kim
,
Jaeyong Park
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
Slides
DOI
Slide Show
Performance Analysis of GEMV Kernels by GPU and PIM Memory Address Mapping Approaches
Processing-in-Memory(PIM)은 프로세서와 오프칩(off-chip) …
Jiwon Shin
,
Gunjae Koo
PDF
Cite
Project
Slides
Slide Show
Conflict-Aware Compiler for Hierarchical Register File on GPUs
Modern graphics processing units (GPUs) leverage a high degree of thread-level parallelism, necessitating large-sized register files …
Eunbi Jeong
,
Eun Seong Park
,
Gunjae Koo
,
Yunho Oh
,
Myung Kuk Yoon
PDF
Cite
Project
DOI
Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs
This paper proposes a new scheme that improves throughput and reduces queuing delay while running multiple inferences in embedded …
Jaebeom Jeon
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
DOI
Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors
The performance of GPU’s external memories is becoming more critical since a modern GPU runs thousands of concurrent threads that …
Jonghyun Jeong
,
Myung Kuk Yoon
,
Yunho Oh
,
Gunjae Koo
PDF
Cite
Project
Slides
DOI
Slide Show
Analyzing GCN Aggregation on GPU
Graph convolutional neural networks (GCNs) are emerging neural networks for graph structures that include large features associated …
Inje Kim
,
Jonghyun Jeong
,
Yunho Oh
,
Myung Kuk Yoon
,
Gunjae Koo
PDF
Cite
Project
Project
DOI
»
Cite
×