[DO NOT MERGE] add all level buffer support when computing infer_auto_device_map #2663

SunMarc · 2024-04-12T12:52:06Z

What does this PR do ?

This PR adds all level support when computing infer_auto_device_map. This is especially important for persistant buffer that are in the state_dict. If we don't attribute a device to those buffers, we will get error in transformers when we try to load the state_dict. We might hit edge cases with cpu and disk offload.
Usually buffers are defined in a nn.Module that don't have other children modules. However, in a model such as RecurrentGemmaModel, normalize buffer is defined at the level of the model. So when we have model = RecurrentGemmaForCausalLM, the persistent buffer is defined there at this location model.model.normalize but model.model have children modules such as layers.
cc @ArthurZucker

Another solution would be to just use non persistant buffer all the time for these cases
-> This is the solution that we are going with for now. We can merge this PR if it makes sense later.

HuggingFaceDocBuilderDev · 2024-04-12T12:55:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

Thanks, overall this looks fine to me.

Let's make a code comment here potentially, something like:

TODO: May have cpu and disk offload edge-cases.

github-actions · 2024-05-12T15:05:54Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

add all level buffer support when computing infer_auto_device_map

78495f6

style

d0caadf

muellerzr approved these changes Apr 12, 2024

View reviewed changes

SunMarc changed the title ~~add all level buffer support when computing infer_auto_device_map~~ [DO NOT MERGE] add all level buffer support when computing infer_auto_device_map Apr 12, 2024

muellerzr added enhancement New feature or request feature request Request for a new feature to be added to Accelerate wip Work in progress labels May 13, 2024

leloykun mentioned this pull request Aug 8, 2024

Improve support for image generation with Chameleon & Anole huggingface/transformers#32013

Open

39 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT MERGE] add all level buffer support when computing infer_auto_device_map #2663

[DO NOT MERGE] add all level buffer support when computing infer_auto_device_map #2663

SunMarc commented Apr 12, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 12, 2024

muellerzr left a comment

github-actions bot commented May 12, 2024

[DO NOT MERGE] add all level buffer support when computing infer_auto_device_map #2663

Are you sure you want to change the base?

[DO NOT MERGE] add all level buffer support when computing infer_auto_device_map #2663

Conversation

SunMarc commented Apr 12, 2024 • edited Loading

What does this PR do ?

HuggingFaceDocBuilderDev commented Apr 12, 2024

muellerzr left a comment

Choose a reason for hiding this comment

github-actions bot commented May 12, 2024

SunMarc commented Apr 12, 2024 •

edited

Loading