Skip to content

Latest commit

 

History

History
44 lines (34 loc) · 1.37 KB

README.md

File metadata and controls

44 lines (34 loc) · 1.37 KB

ilab-on-ocp

This repo will serve as the central location for the Containerfiles and yamls needed to deploy Instructlab onto an OpenShift cluster with RHOAI.

Requirements

The following Operators must be installed on the cluster

  • Red Hat - Authorino
  • NVIDIA GPU Operator
  • Node Feature Discovery
  • Red Hat OpenShift AI
  • Red Hat OpenShift Serverless
  • Red Hat OpenShift Service Mesh

NVIDIA GPU Operator

A ClusterPolicy must be deployed. The definition provided when clicking the "Create ClusterPolicy" although generic installs all required components.

Accelerator Profile

An accelerator profile must be defined within the RHOAI dashboard or via CLI to enable GPU acceleration.

apiVersion: v1
items:
- apiVersion: dashboard.opendatahub.io/v1
  kind: AcceleratorProfile
  metadata:
    name: gpu
    namespace: redhat-ods-applications
  spec:
    displayName: gpu
    enabled: true
    identifier: nvidia.com/gpu
    tolerations: []

Signed Certificate

A signed certificate ensures that there not unnecessary issues when performing the training pipeline.

To deploy a signed certificate in cluster follow trusted cluster cert

Object Storage

This solution requires object storage to be in place either through S3 or using Noobaa.

If you are using Noobaa apply the following tuning paramters