Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #943

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #943

This workflow is awaiting approval from a maintainer in #6553
Triggered via pull request September 18, 2024 12:25
Status Action required
Total duration
Artifacts
This workflow is awaiting approval from a maintainer in #6553

hpu-gaudi2.yml

on: pull_request
unit-tests
unit-tests
Fit to window
Zoom out
Zoom in