-
Notifications
You must be signed in to change notification settings - Fork 1k
Issues: huggingface/accelerate
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
I try to train our model with stylegan-2, find a bug, how I can fix it
#3404
opened Feb 20, 2025 by
lingtengqiu
Transformers test_cpu_offload tests fail with KeyError: 'xpu:0'
#3402
opened Feb 20, 2025 by
dvrogozh
Something WRONG when I saving the trained model with deepspeed stage 3 optimization config
#3399
opened Feb 16, 2025 by
ZYM66
2 of 4 tasks
strange behavior of
split_between_processes
when length of inputs
is smaller than num_processes
#3393
opened Feb 11, 2025 by
hmk114
Forwarding args to the Accelerator in the Trainer class
#3392
opened Feb 10, 2025 by
santiag0m
2 of 4 tasks
IndexError: pop from an empty deque
after 20 seconds of training
#3386
opened Feb 7, 2025 by
JohnConnor123
Compatibility between Accelerate and ThreadPoolExecutor
#3384
opened Feb 6, 2025 by
BiEchi
2 of 4 tasks
Initialize model with empty weight causes OOM with offloading to disk
#3374
opened Feb 1, 2025 by
Aiden-Frost
2 of 4 tasks
loading the prodigy optimizer does not move custom parameters to the accelerator
#3372
opened Jan 29, 2025 by
bghira
4 tasks
Training hangs indefinitely on first forward pass when using TPU v3-8 in Kaggle
#3370
opened Jan 27, 2025 by
WpythonW
2 of 4 tasks
Gradient accumulation with deepSpeed has issue if not set during configuration
#3369
opened Jan 27, 2025 by
khalil-Hennara
2 of 4 tasks
DataLoaderShard wrongly yields None instead of StopIteration when its dataloader returns StopIteration immediately
#3367
opened Jan 25, 2025 by
Aleko2286
2 of 4 tasks
"@verify_operation" lead to pretrain of multi-nodes hang
#3364
opened Jan 24, 2025 by
sankexin
2 of 4 tasks
Google Colab TPU
notebook_launcher
doesn't work
#3358
opened Jan 21, 2025 by
matinmoezzi
2 of 4 tasks
[Feature Request] include a DeepSpeed multi-node config slurm example
contributions-welcome
deepspeed
DS related issues/PRs
#3338
opened Jan 13, 2025 by
sayakpaul
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.