Multi-GPU training for object detection #7527
Unanswered
Rajesh-ParaxialTech
asked this question in
Q&A
Replies: 1 comment 2 replies
-
Hi @Rajesh-ParaxialTech, thanks for your interest here.
I didn't fully get your question, what do you want to incorparate? Do you mean how to ensure your pipeline can using DDP?
Hope it helps, thanks. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello
Thanks a lot for the nice GPU training concept and support. Currently I have been training an object detection model using the "Single Node multi-gpu " mode. I have tried to incorporate the features mentioned in the Brats segmentation (in my case I am trying the object detection model training) in my code. But couldnt find a way to incorporate the following:
Please let me know where i can incorporate in the code. My feeling is that this is to be incorporated somewhere in this file ".local/lib/python3.10/site-packages/monai/bundle/reference_resolver.py" of the pipeline used in the object detection tasks. At this point the config_items are evaluated and my object detection model (i.e the retinanet detector training) is exceuted with data, I think.
Without using the statement (mentioned in the figure above) in my code, while running the code, i am getting this error shown in figure below
Also my question is whether this line "model = DistributedDataParallel(model, device_ids=[device]) " mentioned in the figure above (first figure) is required for single Node multi-GPU training mode. Or is it required only for multi-node, multi-GPU training ?
It would be really helpful, If i could be guided in this regard.
Thanking you again
Rajesh
Beta Was this translation helpful? Give feedback.
All reactions