Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11441

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #11441

This workflow is awaiting approval from a maintainer in #6553
Triggered via pull request January 17, 2025 10:26
Status Action required
Total duration
Artifacts
This workflow is awaiting approval from a maintainer in #6553

python.yml

on: pull_request
Matrix: unit-tests
Waiting for pending jobs
Fit to window
Zoom out
Zoom in