Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 334 Bytes

File metadata and controls

11 lines (8 loc) · 334 Bytes
description
DNN batching inference system to reduce the latency and improve the throughput.

DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Service...

Meta Info

DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs