-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deploy Bundle Edge Version #164
Comments
Hello @moula! Thanks for contacting us! We are working to fix that. |
@moula could you try again, please? You can also try the latest on the candidate channel ( |
We have fixed the issue and released a new revision of the slurm charms to edge. Thank you for trying the slurm charms! Let us know if you need any help. |
@jaimesouza I'm trying again, I'll get back to you as soon as it's done. |
@jaimesouza it works, but during the deployment I had to reboot the machines manually which is not the same thing with version 8.5. I will add Nvidia GPUs and monitoring tomorrow in order to test it in use. Merci. |
Hello @moula! Thanks for stopping by! We have another charm called, To add cuda drivers to your deployment, you could deploy and relate the juju deploy nvidia-gpu --channel edge
juju relate nvidia-gpu slurmd |
@jamesbeedy Thank you very mauch. |
Hey @moula, We will be collaborating with @NucciTheBoss and @dvdgomez from the HPC team at Canonical for the next amount of time to revise/refactor the Slurm charms. Any changes you see in the Canonical/slurm*-operator forks will eventually end up getting PR’d into the omnivector-solutions/slurm*-operator repos. |
Hi there @moula! Yes, I have the migration completed and plan on submitting for upstream to the Omnivector folks soon. I am also tackling the versioning issue mentioned in charmed-hpc/slurmdbd-operator#5, which is where I think you received the original install error that Jamie and James fixed. |
Bonjour @NucciTheBoss Yes You are all doing a good job. Thank you so much. |
@jamesbeedy Thank you. |
Hey @moula , It takes a few minutes following a reboot for that message to disappear. We can try and clean up the messaging around rebooting by making a shorter period in between polling to see if the machine needs a reboot. Most likely your first reboot worked 🙂 |
@jamesbeedy keep it up . Thank's. |
Bonjour,
I tried to deploy the edge version of the bundle on my data-center. Everything installs except slurmrestd. Thank's.
The text was updated successfully, but these errors were encountered: