-
Notifications
You must be signed in to change notification settings - Fork 265
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to train YOLO in the container? error: AssertionError: Torch not compiled with CUDA enabled #799
Comments
Hi, from the error msg the target device is not specified as expected. I'm looking into whether some device orientation is missed. |
set
|
I got it. ultralytics 8.3.99 in the container, but 8.3.72 in the host. downgrade to 8.3.72 in the container, worked.
|
It works for both AMP & non-AMP after the versions got aligned? |
No, it's not not worked when still:
|
Here is the GPU info: 😄
|
When the training task finished, the new model has created, got some errors:
|
Hi @jiekechoo , the errors you encountered is due that the ultralytics yolov8/v11 repo does not support Intel GPU ( Since
With these changes your training script is workable at my side, please give it a try and check whether it solves your problem. Thanks! |
@ZailiWang Thanks for your reply. How can I use IPEX to train models across multiple GPUs, either on a single host or in a Kubernetes cluster? Are there any official documents I can refer to for this? |
Let's sync it up via mail. |
fixed after this guide. Thanks. |
@ZailiWang Thanks for mentioning my workaround in ultralytics to train YOLO model on Intel GPUs |
I’m trying to create a PR for this issue. |
Describe the issue
Run docker container
Python environment and
I followed this article: https://community.intel.com/t5/GPU-Compute-Software/How-to-train-yolov8-using-intel-arc-770-gpu/m-p/1597471 , but not worked. Is there any guide for training YOLO ?
The text was updated successfully, but these errors were encountered: