Skip to content

Files

Latest commit

959ffa7 · Jul 27, 2023

History

History
37 lines (22 loc) · 1.79 KB

File metadata and controls

37 lines (22 loc) · 1.79 KB

GaiaGPU: Sharing GPUs in container clouds

Metadata

Presented in ISPA/IUCC/BDCloud/SocialCom/SustainCom 2018.

Authors: Jing Gu, Shengbo Song, Ying Li, Hanmei Luo

Code

Understanding the paper

TL;DR

This paper presents an approach named GaiaGPU, to share GPU memory and computing resources among containers. It gives a solution for providing GPU sharing in the cloud.

Technical details

The architecture of GaiaGPU

  • The vGPU Library running in the container is used to manage the GPU resources.
    • It intercepts the memory-related APIs and the computing-related APIs in the CUDA Library by the LD_LIBRARY_PATH mechanism. 12 CUDA Driver APIs are intercepted.

The intercepted CUDA Driver APIs

  • Two allocation methods are adopted to improve the utilization.
    1. Elastic resource allocation: temporarily modify the computing resource limit of the container, soft limit. The max utilization of GPU is set as a parameter and the default value is 90%.
    2. Dynamic resource allocation: permanently modify the resource allocation (memory, computing resource) of the container, hard limit.

Limitations

  • The experimental part is rudimentary; only the micro-benchmark has been done.
  • The effect of elastic allocation of computing resources does not seem to be particularly stable.