Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy · 2024-11-23T13:32:14Z

Feature request

Hi thanks for the lib! Currently there is adamw_torch_4bit, but I hope to mimic it to have a adamw_torch_8bit that uses 8bit torchao adamw.

The reason is that, I would like to use deepspeed cpu offload for the optimizer, and also use 8bit adamw. However, the 8bit one in current hf transformers does not support cpu, so I need to use the torchao one.

Motivation

Your contribution

yes

The text was updated successfully, but these errors were encountered:

fzyzcjy added the Feature request Request for a new feature label Nov 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy commented Nov 23, 2024 •

edited

Loading

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

Mimic adamw_torch_4bit and have adamw_torch_8bit #34893

Comments

fzyzcjy commented Nov 23, 2024 • edited Loading

Feature request

Motivation

Your contribution

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

Mimic `adamw_torch_4bit` and have `adamw_torch_8bit` #34893

fzyzcjy commented Nov 23, 2024 •

edited

Loading