CUDA Support for Kubernetes #143

AndreasMurk · 2024-01-22T14:49:05Z

I have seen that once the --with-cuda flag is provided, the cuda-ggml Image is build using the context in the docker-compose file.

It would be nice to also support CUDA when deploying with Kubernetes. If there is support or a way to deploy the Pods consuming GPUs, I couldn't find it in the README

From a quick look, the following steps would be required:

Make CUDA image publicly available through https://ghcr.io
Create CUDA Service manifest / or set the container image to the cuda image once a flag is provided
Add resource: limits: nvidia/gpu: 1 to the container section in the UI deployment

If you could publish the image, I could create a PR and work on this.

Thank you

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA Support for Kubernetes #143

CUDA Support for Kubernetes #143

AndreasMurk commented Jan 22, 2024 •

edited

Loading

CUDA Support for Kubernetes #143

CUDA Support for Kubernetes #143

Comments

AndreasMurk commented Jan 22, 2024 • edited Loading

AndreasMurk commented Jan 22, 2024 •

edited

Loading