Fix freezing modules in Ghost Clipping #729

EnayatUllah · 2025-02-11T20:32:02Z

Summary:
Freezing modules with ghost clipping throws an error as corresponding per-sample norms are (not) calculated. Fix: keep in memory the list of all parameters and checking if corresponding requires_grad is True when calculating norms.

Further, unfreezing modules (with and without ghost clipping) wasn't supported because the hooks aren't present for the corresponding modules. Fix: rewrite requires_grad_ to add the hook.

facebook-github-bot · 2025-02-11T20:32:12Z

This pull request was exported from Phabricator. Differential Revision: D68656459

Summary: Freezing modules with ghost clipping throws an error as corresponding per-sample norms are (not) calculated. Fix: keep in memory the list of all parameters and checking if corresponding requires_grad is True when calculating norms. Further, unfreezing modules (with and without ghost clipping) wasn't supported because the hooks aren't present for the corresponding modules. Fix: rewrite `requires_grad_' to add the hook. Facebook We initially used a `trainable_parameters(module)` to traverse the list of trainable modules upon norm computation. It was slow because `trainable_parameters(module)` is a generator and it traverses the neural network graph overtime. We replaced it with a list of trainable parameters fixed during model creation time. This is what lead to issues with freezing modules as this list is not updated. Fix: Use **all parameters** **list** -- not a generator, so no traversal happens. Further, we check `requires_grad` when calculating per-sample norm to ascertain whether to compute it or not. This is how this check is done in (non-private) [optimizer](https://github.com/pytorch/pytorch/blob/5725462cd8679dd1dea8a469b1bf2e71f226b664/torch/optim/optimizer.py#L963) to determine which parameters are frozen or not. Differential Revision: D68656459

facebook-github-bot · 2025-02-11T20:53:12Z

This pull request was exported from Phabricator. Differential Revision: D68656459

Summary: Freezing modules with ghost clipping throws an error as corresponding per-sample norms are (not) calculated. Fix: keep in memory the list of all parameters and checking if corresponding requires_grad is True when calculating norms. Further, unfreezing modules (with and without ghost clipping) wasn't supported because the hooks aren't present for the corresponding modules. Fix: rewrite `requires_grad_' to add the hook. Facebook We initially used a `trainable_parameters(module)` to traverse the list of trainable modules upon norm computation. It was slow because `trainable_parameters(module)` is a generator and it traverses the neural network graph overtime. We replaced it with a list of trainable parameters fixed during model creation time. This is what lead to issues with freezing modules as this list is not updated. Fix: Use **all parameters** **list** -- not a generator, so no traversal happens. Further, we check `requires_grad` when calculating per-sample norm to ascertain whether to compute it or not. This is how this check is done in (non-private) [optimizer](https://github.com/pytorch/pytorch/blob/5725462cd8679dd1dea8a469b1bf2e71f226b664/torch/optim/optimizer.py#L963) to determine which parameters are frozen or not. Differential Revision: D68656459

facebook-github-bot · 2025-02-11T21:47:04Z

This pull request was exported from Phabricator. Differential Revision: D68656459

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 11, 2025

facebook-github-bot added the fb-exported label Feb 11, 2025

EnayatUllah force-pushed the export-D68656459 branch from 6c3ee58 to 60b7522 Compare February 11, 2025 20:53

EnayatUllah force-pushed the export-D68656459 branch from 60b7522 to 9eb7875 Compare February 11, 2025 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix freezing modules in Ghost Clipping #729

Fix freezing modules in Ghost Clipping #729

EnayatUllah commented Feb 11, 2025 •

edited

Loading

facebook-github-bot commented Feb 11, 2025

facebook-github-bot commented Feb 11, 2025

facebook-github-bot commented Feb 11, 2025

Fix freezing modules in Ghost Clipping #729

Are you sure you want to change the base?

Fix freezing modules in Ghost Clipping #729

Conversation

EnayatUllah commented Feb 11, 2025 • edited Loading

facebook-github-bot commented Feb 11, 2025

facebook-github-bot commented Feb 11, 2025

facebook-github-bot commented Feb 11, 2025

EnayatUllah commented Feb 11, 2025 •

edited

Loading