-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/NeMo
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use Mcore ModelParallelConfig in strategy parallelism property
Run CICD
skip-docs
#11232
opened Nov 8, 2024 by
hemildesai
Loading…
Fix head_size in NeMo to HF checkpoint converters for width pruned model support
Run CICD
skip-docs
#11230
opened Nov 8, 2024 by
eagle705
Loading…
2 of 8 tasks
Update README.md for collection page
documentation
Improvements or additions to documentation
#11223
opened Nov 8, 2024 by
yaoyu-33
Loading…
1 of 8 tasks
Hyena wrapper: Weight decay override function
#11203
opened Nov 7, 2024 by
guyjacob
Loading…
8 tasks
Handle _io_unflatten_object when _thread_local.output_dir is not available
#11199
opened Nov 7, 2024 by
hemildesai
Loading…
fix: regular torch optims (e.g., sgd) no longer error with closure spec
core
Changes to NeMo Core
#11189
opened Nov 6, 2024 by
terrykong
Loading…
8 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.