Multiple Improvements for mmengine #1629

MGAMZ · 2025-01-17T16:02:17Z

Motivation

During the deep use of the mmengine framework, I improved several subtle issues, hoping to make the project more compatible with the current latest PyTorch version.

Modification

Judgment on the ‘disable’ parameter in the compile function

In the current PyTorch compile configuration (PyTorch Compile Doc), there is a ‘disable’ parameter. The mmengine compile implementation does not judge the ‘disable’ parameter; as long as dict is set as compile, the mmengine will definitely be compiled.

Fix optim state loading BUG when using FSDP

The torch.distributed.fsdp.fully_sharded_data_parallel.FullyShardedDataParallel.optim_state_dict_to_load method requires the following parameters:

model
optim
optim_state_dict
is_named_optimizer
load_directly
group

mmengine's FSDP strategies' call is incorrect, and will cause error.

Update `GradScaler` to align with the latest PyTorch version

from torch.cuda.amp import GradScaler will raise a PyTorch Warning, this import method will be deprecated in the future.

Update `Adafactor` to align with the latest PyTorch version

transformers' Adafactor optimizer has been implemented by PyTorch now. So it no longer requires OPTIMIZERS.register_module.

Add Pure-Python style config for `OptimWrapperConstructor`

The current mmengine does not support Pure-Python style config of OptimWrapperConstructor.

Update torch load to align to the latest PyTorch version.

torch.load requires weights_only param in the future, it currently raises warnings.

Add Pure-Python style config for `model_wrapper`

The current mmengine does not support Pure-Python style config of model_wrapper.

Improve the warning information in Visualization

The improvement is minor, just add more hints.

FutureWarning: `torch.cuda.amp.GradScaler(args...)` is deprecated. Please use `torch.amp.GradScaler('cuda', args...)` instead.

FSDP.optim_state_dict_to_load requires the following parameters: model: Module, optim: Optimizer, optim_state_dict: Dict[str, Any]

…tions The current runner implementation has not yet supported for pure-python style configurations on model wrapper class. I follow the mainstream implementation to support this feature.

This may be due to the version confliction. Newer PyTorch may have introduced this optimizer.

This reverts commit 8f37dd2.

This reverts commit be86710.

This reverts commit 7103c3e.

This reverts commit eecaa92.

CLAassistant · 2025-01-17T16:02:24Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ MGAMZ
❌ 张贻钦

张贻钦 seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

MGAMZ and others added 25 commits July 21, 2024 17:44

Fix torch FutureWarning

4f62c98

FutureWarning: `torch.cuda.amp.GradScaler(args...)` is deprecated. Please use `torch.amp.GradScaler('cuda', args...)` instead.

Fix torch FutureWarning

b6b4224

FutureWarning: `torch.cuda.amp.GradScaler(args...)` is deprecated. Please use `torch.amp.GradScaler('cuda', args...)` instead.

Optimize the prompt for compile

4c7a5d4

Fix Incorrect Optim Param Resume Method

28d47f8

FSDP.optim_state_dict_to_load requires the following parameters: model: Module, optim: Optimizer, optim_state_dict: Dict[str, Any]

Update runner.py to support pure-python style model wrapper configura…

91d945f

…tions The current runner implementation has not yet supported for pure-python style configurations on model wrapper class. I follow the mainstream implementation to support this feature.

Merge branch 'open-mmlab:main' into main

0934d75

reconstruct

7103c3e

PyTorch Profiler within IterBasedTrainLoop

eecaa92

enable hook error exception traceback

698ad5e

Merge branch 'main' of github.com:MGAMZ/mmengine

8c80332

Merge branch 'open-mmlab:main' into main

3cf1003

improve codes

1e4c2ed

Merge branch 'open-mmlab:main' into main

2a5a1fe

KeyError: 'Adafactor is already registered in optimizer at torch.optim'.

29e3a08

This may be due to the version confliction. Newer PyTorch may have introduced this optimizer.

Merge branch 'main' of https://github.com/MGAMZ/mmengine

896576b

Update support for deep speed and multiple improvements.

be86710

Merge branch 'main' of gitee.com:MGAM/mmengine

dadedbb

improve multiple mmengine undeveloped issues.

861fc1b

Multiple improvements

8f37dd2

Merge branch 'open-mmlab:main' into main

bed2660

Revert "Multiple improvements"

46cfdbb

This reverts commit 8f37dd2.

Revert "Update support for deep speed and multiple improvements."

5376661

This reverts commit be86710.

Revert "reconstruct"

f038e5e

This reverts commit 7103c3e.

Revert "PyTorch Profiler within IterBasedTrainLoop"

e36e2f1

This reverts commit eecaa92.

fix

834cf9a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Improvements for mmengine #1629

Multiple Improvements for mmengine #1629

MGAMZ commented Jan 17, 2025

CLAassistant commented Jan 17, 2025

Multiple Improvements for mmengine #1629

Are you sure you want to change the base?

Multiple Improvements for mmengine #1629

Conversation

MGAMZ commented Jan 17, 2025

Motivation

Modification

Judgment on the ‘disable’ parameter in the compile function

Fix optim state loading BUG when using FSDP

Update GradScaler to align with the latest PyTorch version

Update Adafactor to align with the latest PyTorch version

Add Pure-Python style config for OptimWrapperConstructor

Update torch load to align to the latest PyTorch version.

Add Pure-Python style config for model_wrapper

Improve the warning information in Visualization

CLAassistant commented Jan 17, 2025

Update `GradScaler` to align with the latest PyTorch version

Update `Adafactor` to align with the latest PyTorch version

Add Pure-Python style config for `OptimWrapperConstructor`

Add Pure-Python style config for `model_wrapper`