qlora和lora训练的区别是啥 #4206

ChenBinfighting1 · 2024-06-11T08:56:20Z

Reminder

I have read the README and searched the existing issues.

System Info

qwen_lora_sft_gptq.yaml的配置信息如下：

model

model_name_or_path: /data/LLM/qwen/Qwen1.5-7B-Chat
quantization_bit: 4

method

stage: sft
do_train: true
finetuning_type: lora
lora_target: all

dataset

dataset: asia_data
template: qwen
cutoff_len: 1024
max_samples: 1000
val_size: 0.1
overwrite_cache: true
preprocessing_num_workers: 16

output

output_dir: saves/qwen1.5-7b-int4/all
logging_steps: 10
save_steps: 200
plot_loss: true
overwrite_output_dir: true

train

per_device_train_batch_size: 2
gradient_accumulation_steps: 2
learning_rate: 0.0001
num_train_epochs: 5.0
lr_scheduler_type: cosine
warmup_steps: 0.1
fp16: true

eval

per_device_eval_batch_size: 1
evaluation_strategy: steps
eval_steps: 500

Reproduction

CUDA_VISIBLE_DEVICES=1 python3 src/train.py examples/qlora_single_gpu/qwen_lora_sft_gptq.yaml

Expected behavior

看到有人问相同的问题，但貌似没有看到答复，所以也提交了issue。
看着配置文件lora和qlora是一样的，那我们怎么区分是Lora还是qlora的？
个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

Others

No response

hiyouga · 2024-06-11T09:02:46Z

通过 quantization_bit 区分

ChenBinfighting1 · 2024-06-11T09:31:10Z

通过 quantization_bit 区分

谢谢！

1，我的这个理解是否正确？

2，如果使用的gptq版本模型，quantization_bit=4存在与否都是lora训练吗？

ChenBinfighting1 · 2024-06-11T09:31:34Z

个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

hiyouga · 2024-06-11T09:51:38Z

gptq 不需要加额外参数

joyce0105-ops · 2024-06-24T07:41:57Z

个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

请问你懂了吗？我看gptq和普通lora的yaml文件是一样的，除了模型名字，那应该就是对gptq量化后的模型来进行lora？不知我这个理解对不对，想请教一下

ChenBinfighting1 · 2024-06-25T02:06:58Z

个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

请问你懂了吗？我看gptq和普通lora的yaml文件是一样的，除了模型名字，那应该就是对gptq量化后的模型来进行lora？不知我这个理解对不对，想请教一下

qlora是在加载模型时候采用nf4量化方式【其中涉及量化的技巧，比如参数服从（0，1）正态分布】，所以使用的模型需要非量化版本，并且增加参数quantization_bit: 4 ，以此区分是lora还是qlora

huangfu170 · 2024-09-20T02:40:54Z

个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

请问你懂了吗？我看gptq和普通lora的yaml文件是一样的，除了模型名字，那应该就是对gptq量化后的模型来进行lora？不知我这个理解对不对，想请教一下

qlora是在加载模型时候采用nf4量化方式【其中涉及量化的技巧，比如参数服从（0，1）正态分布】，所以使用的模型需要非量化版本，并且增加参数quantization_bit: 4 ，以此区分是lora还是qlora

所以佬，gptq的那个example是啥意思...从配置文件感觉像是对量化过的模型做lora

ChenBinfighting1 · 2024-09-24T08:14:17Z

个人理解：quantization_bit: 4 增加这个参数，是不是模型加载时候用nf4格式加载【qlora的核心之一】，所以此时是启动了qlora方式训练？

请问你懂了吗？我看gptq和普通lora的yaml文件是一样的，除了模型名字，那应该就是对gptq量化后的模型来进行lora？不知我这个理解对不对，想请教一下

qlora是在加载模型时候采用nf4量化方式【其中涉及量化的技巧，比如参数服从（0，1）正态分布】，所以使用的模型需要非量化版本，并且增加参数quantization_bit: 4 ，以此区分是lora还是qlora

所以佬，gptq的那个example是啥意思...从配置文件感觉像是对量化过的模型做lora

我理解是这个意思

ChenBinfighting1 · 2024-09-24T08:16:44Z

通过 quantization_bit 区分
你好，想咨询一下，我看yaml文件中是通过quantization_bit 区分qlora和lora，但是在lamafactory-cli train时候有一个参数--quantization_method bitsandbytes，这种方式的训练也是qlora吗？

Liufeiran123 · 2024-09-26T01:54:06Z

gptq量化后，可以继续QLORA微调吧。

github-actions bot added the pending This problem is yet to be addressed label Jun 11, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 11, 2024

hiyouga closed this as completed Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qlora和lora训练的区别是啥 #4206

qlora和lora训练的区别是啥 #4206

ChenBinfighting1 commented Jun 11, 2024

hiyouga commented Jun 11, 2024

ChenBinfighting1 commented Jun 11, 2024

ChenBinfighting1 commented Jun 11, 2024

hiyouga commented Jun 11, 2024

joyce0105-ops commented Jun 24, 2024 •

edited

Loading

ChenBinfighting1 commented Jun 25, 2024

huangfu170 commented Sep 20, 2024

ChenBinfighting1 commented Sep 24, 2024

ChenBinfighting1 commented Sep 24, 2024

Liufeiran123 commented Sep 26, 2024

qlora和lora训练的区别是啥 #4206

qlora和lora训练的区别是啥 #4206

Comments

ChenBinfighting1 commented Jun 11, 2024

Reminder

System Info

model

method

dataset

output

train

eval

Reproduction

Expected behavior

Others

hiyouga commented Jun 11, 2024

ChenBinfighting1 commented Jun 11, 2024

ChenBinfighting1 commented Jun 11, 2024

hiyouga commented Jun 11, 2024

joyce0105-ops commented Jun 24, 2024 • edited Loading

ChenBinfighting1 commented Jun 25, 2024

huangfu170 commented Sep 20, 2024

ChenBinfighting1 commented Sep 24, 2024

ChenBinfighting1 commented Sep 24, 2024

Liufeiran123 commented Sep 26, 2024

joyce0105-ops commented Jun 24, 2024 •

edited

Loading