Computer System Architecture Lab
Computer System Architecture Lab
Home
News
Members
Publications
Research
Gallery
Contact
Light
Dark
Automatic
TLP
TLP Balancer: Predictive Thread Allocation for Multitenant Inference in Embedded GPUs
This letter introduces a novel software technique to optimize thread allocation for merged and fused kernels in multitenant inference …
Minseong Gil
,
Jaebeom Jeon
,
Junsu Kim
,
Sangun Choi
,
Gunjae Koo
,
Myung Kuk Yoon
,
Yunho Oh
PDF
Cite
Project
DOI
Cite
×