Could not use SFT Trainer in qlora_finetuning.py #12356

shungyantham · 2024-11-07T05:31:27Z

I have installed trl<0.12.0 to run qlora_finetune.py in the QLoRA/trl-example but it requires transformers 4.46.2 which causes the error below.

So I downgraded trl from 0.11.4 to 0.9.6 and I got another padding error.

qiyuangong · 2024-11-08T02:21:25Z

I have installed trl<0.12.0 to run qlora_finetune.py in the QLoRA/trl-example but it requires transformers 4.46.2 which causes the error below.

So I downgraded trl from 0.11.4 to 0.9.6 and I got another padding error.

These errors is caused by transformers version mismatch. Can you downgrade transformers version to 4.36.0 ?

pip install transformers==4.36.0 datasets

shungyantham · 2024-11-08T02:46:19Z

Hi, I have also downgraded the transformers to 4.36.0 when I downgrade the trl to 0.9.6 and I got this error

shungyantham · 2024-11-08T03:33:25Z

https://github.com/intel-analytics/ipex-llm/blob/main/docker/llm/finetune/xpu/Dockerfile

I build this Dockerfile and then manually pip install trl==0.9.6 in the docker container. I ran the qlora_finetune.py in LLM_Finetuning/QLoRA/trl-example. Is there anything I missed?

qiyuangong · 2024-11-08T07:28:52Z

https://github.com/intel-analytics/ipex-llm/blob/main/docker/llm/finetune/xpu/Dockerfile

I build this Dockerfile and then manually pip install trl==0.9.6 in the docker container. I ran the qlora_finetune.py in LLM_Finetuning/QLoRA/trl-example. Is there anything I missed?

Hi @shungyantham , we have reproduced this issue in our local env.

Please modify qlora_finetune.py Line 91. Add data_collator=transformers.DataCollatorForSeq2Seq( tokenizer, pad_to_multiple_of=8, return_tensors="pt", padding=True ) to SFTTrainer.

Code should look like this:

    trainer = SFTTrainer(
        model=model,
        train_dataset=train_data,
        args=transformers.TrainingArguments(
            per_device_train_batch_size=4,
            gradient_accumulation_steps= 1,
            warmup_steps=20,
            max_steps=200,
            learning_rate=2e-5,
            save_steps=100,
            bf16=True,  # bf16 is more stable in training
            logging_steps=20,
            output_dir="outputs",
            optim="adamw_hf", # paged_adamw_8bit is not supported yet
            gradient_checkpointing=True, # can further reduce memory but slower
        ),
        dataset_text_field="instruction",
        data_collator=transformers.DataCollatorForSeq2Seq(
            tokenizer, pad_to_multiple_of=8, return_tensors="pt", padding=True
        )
    )

qiyuangong · 2024-11-08T07:38:05Z

#12368

shungyantham · 2024-11-11T02:04:05Z

Hi @qiyuangong , I have faced another issue after adding the padding to the Trainer

qiyuangong · 2024-11-11T08:04:54Z

Hi @qiyuangong , I have faced another issue after adding the padding to the Trainer

Please provide transformers and trl version, as well as finetune.py.

For reference, this is key lib versions in our test env. After merging that PR, it can finetune model without modification.

transformers                4.36.0
trl                         0.9.6

shungyantham · 2024-11-13T08:40:45Z

Hi @qiyuangong ,

I setup a docker container following this link:
https://github.com/intel-analytics/ipex-llm/tree/main/docker/llm/finetune/xpu

inside this container, I manually installed trl==0.9.6 and ran qlora_finetuning.py in /LLM-Finetuning/QLoRA/trl-example. My transformers version is 4.36.0.

qiyuangong · 2024-11-13T11:22:00Z

Hi @qiyuangong ,

I setup a docker container following this link: https://github.com/intel-analytics/ipex-llm/tree/main/docker/llm/finetune/xpu

inside this container, I manually installed trl==0.9.6 and ran qlora_finetuning.py in /LLM-Finetuning/QLoRA/trl-example. My transformers version is 4.36.0.

Essential lib versions are correct. Please share your launch command and qlora_finetuning.py.

qiyuangong · 2024-11-14T02:12:01Z

We can resolve this padding error by adding these lines into qlora_finetuning.py.

if tokenizer.pad_token is None:
    tokenizer.pad_token = tokenizer.eos_token

I will submit a PR for this change.

qiyuangong · 2024-11-14T02:24:14Z

#12398

jason-dai added the user issue label Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could not use SFT Trainer in qlora_finetuning.py #12356

Could not use SFT Trainer in qlora_finetuning.py #12356

shungyantham commented Nov 7, 2024

qiyuangong commented Nov 8, 2024

shungyantham commented Nov 8, 2024

shungyantham commented Nov 8, 2024

qiyuangong commented Nov 8, 2024

qiyuangong commented Nov 8, 2024

shungyantham commented Nov 11, 2024

qiyuangong commented Nov 11, 2024 •

edited

Loading

shungyantham commented Nov 13, 2024

qiyuangong commented Nov 13, 2024

qiyuangong commented Nov 14, 2024

qiyuangong commented Nov 14, 2024

Could not use SFT Trainer in qlora_finetuning.py #12356

Could not use SFT Trainer in qlora_finetuning.py #12356

Comments

shungyantham commented Nov 7, 2024

qiyuangong commented Nov 8, 2024

shungyantham commented Nov 8, 2024

shungyantham commented Nov 8, 2024

qiyuangong commented Nov 8, 2024

qiyuangong commented Nov 8, 2024

shungyantham commented Nov 11, 2024

qiyuangong commented Nov 11, 2024 • edited Loading

shungyantham commented Nov 13, 2024

qiyuangong commented Nov 13, 2024

qiyuangong commented Nov 14, 2024

qiyuangong commented Nov 14, 2024

qiyuangong commented Nov 11, 2024 •

edited

Loading