-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Time cost issue while using multiprocessing
in condaenv
#93
Comments
Hi @rhaegar325. This is fairly unusual, I've not ever seen this much of a performance regression with these environments. There is an additional overhead of launching the initial python process due to also launching a container, but once that process is running, all subsequent processes launched by multithreading are in the container, and so shouldn't take any longer to launch than any if they were launched from any other distribution. Could you please upload your script somewhere so I can take a look? |
Hi @dsroberts, thanks for your reply. It would be great if you have time to look into my code, here is the link to my script: (https://github.com/ACCESS-NRI/MED-utils/blob/main/access_med_utils/CMORise.py), more specificly, the function running in subprocess was there (https://github.com/ACCESS-NRI/MED-utils/blob/6f0693fd453f1177ffc3483398e5521fa5fd353a/access_med_utils/CMORise.py#L202), and the multiprocessing function part was there: (https://github.com/ACCESS-NRI/MED-utils/blob/6f0693fd453f1177ffc3483398e5521fa5fd353a/access_med_utils/CMORise.py#L365). hope that will help you jump to the point quicker. after those days test, I found that even the |
Hi @rhaegar325. Its hard to tell without actually running it myself, but my initial suspicion is that this line (https://github.com/ACCESS-NRI/MED-utils/blob/main/access_med_utils/CMORise.py#L418) is involved. You're running
Then do
On the larger |
Really appreciate for your suggestion @dsroberts , I will have a try first. |
Hi, @dsroberts, However, the issue was still there, I tried couple of ways, the script do generate multiple processes but it seems those processes was blocked in some part and I don't know exactly where. |
Hi, @rbeucher and @dsroberts @truth-quark:
I have some problems while using
multiprocessing
on modules inhh5
andxp65
. my code running good on my local mamba env in kj13, but when I use module inhh5
andxp65
, it's running really slow. after I test, it do running parallelly, but in each process, it pretty slow than it runs in my local env. follow are some output put of my test, make it easier to under.This is time cost when it runs in my local mamba env:
I divide the process into two part, first part is load data from
.nc
file, second part is to convert data format. the output shows the max, min and average time cost of first part, second part and the whole process.and following is the same code run on
hh5/public/modules/conda_concept/analysis3-24.01
andxp65/access-med-0.8
.it's way to slow than it should be.
data:image/s3,"s3://crabby-images/dd8d8/dd8d80fdd6993925906ea64a2183ef34e6ff43b1" alt="Screenshot 2024-08-13 at 12 32 32 pm"
And I also make a test to run sequentially on
hh5
module:it works normally, so I think there might be some problem in running
multiprocessing
, do you have any idea about this, really appreciate if you could have a look.The text was updated successfully, but these errors were encountered: