Computer System Architecture Lab
Computer System Architecture Lab
Home
News
Members
Publications
Research
Gallery
Contact
Light
Dark
Automatic
1
GraphSSD: Graph Semantics Aware SSD
Graph analytics play a key role in a number of applications such as social networks, drug discovery, and recommendation systems. Given …
Kiran Kumar Matam
,
Gunjae Koo
,
Haipeng Zha
,
Hung-Wei Tseng
,
Murali Annavaram
PDF
Cite
Project
DOI
Custom Link
Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
Modern GPUs suffer from cache contention due to the limited cache size that is shared across tens of concurrently running warps. To …
Yunho Oh
,
Gunjae Koo
,
Murali Annavaram
,
Won Woo Ro
PDF
Cite
Project
DOI
Custom Link
CTA-Aware Prefetching and Scheduling for GPU
Albeit GPUs are supposed to be tolerant to long latency of data fetch operation, we observe that L1 cache misses occur in a bursty …
Gunjae Koo
,
Hyeran Jeon
,
Zhenhong Liu
,
Nam Sung Kim
,
Murali Annavaram
PDF
Cite
Project
Slides
DOI
Slide Show
Summarizer: Trading Communication with Computing near Storage
Modern data center solid state drives (SSDs) integrate multiple general-purpose embedded cores to manage flash translation layer, …
Gunjae Koo
,
Kiran Kumar Matam
,
Te I
,
H. V. Krishna Giri Narra
,
Jing Li
,
Hung-Wei Tseng
,
Steven Swanson
,
Murali Annavaram
PDF
Cite
Project
Poster
Slides
DOI
Slide Show
Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
Long latency of memory operation is a prominent performance bottleneck in graphics processing units (GPUs). The small data cache that …
Gunjae Koo
,
Yunho Oh
,
Won Woo Ro
,
Murali Annavaram
PDF
Cite
Project
Slides
DOI
Slide Show
Warped-Preexecution: A GPU Pre-Execution Approach for Improving Latency Hiding
This paper presents a pre-execution approach for improving GPU performance, called P-mode (pre-execution mode). GPUs utilize a number …
Keunsoo Kim
,
Sangpil Lee
,
Myung Kuk Yoon
,
Gunjae Koo
,
Won Woo Ro
,
Murali Annavaram
PDF
Cite
Project
DOI
Custom Link
Revealing Critical Loads and Hidden Data Locality in GPGPU Applications
In graphics processing units (GPUs), memory access latency is one of the most critical performance hurdles. Several warp schedulers and …
Gunjae Koo
,
Hyeran Jeon
,
Murali Annavaram
PDF
Cite
Project
DOI
Custom Link
Warped-Compression: Enabling Power Efficient GPUs through Register Compression
This paper presents Warped-Compression, a warp-level register compression scheme for reducing GPU power consumption. This work is …
Sangpil Lee
,
Keunsoo Kim
,
Gunjae Koo
,
Hyeran Jeon
,
Won Woo Ro
,
Murali Annavaram
PDF
Cite
Project
DOI
Custom Link
Complementary Block-Based Motion Estimation for Frame Rate Up-Conversion
In this paper, we present complementary motion estimation algorithm for motion compensated frame rate up-conversion. The proposed …
Gunjae Koo
,
Kyoung Won Lim
,
Seung Jong Choi
PDF
Cite
DOI
Custom Link
A Robust PRML Read Channel with Digital Timing Recovery for Multi-Format Optical Disc
In this paper, a PRML read channel that supports multiple optical disc formats, i.e. CD, DVD and BD is presented. The read channel …
Gunjae Koo
,
Woochul Jung
,
Heesub Lee
PDF
Cite
DOI
Custom Link
«
Cite
×