Convert QLoRA trained model to AWQ #155

vidhyat98 · 2023-11-04T17:48:44Z

Hi there!
I have an MPT model that was fine tuned with QLoRA. I merged the QLoRA weights to the original model and saved it in 16bit (because save_pretrained() of transformers doesn't allow to save 4bit versions yet).
I am trying to convert this 16bit model to AWQ. After loading the model, when I apply model.quantize(tokenizer, quant_config=quant_config), I get an error saying:
TypeError: forward() got an unexpected keyword argument 'output_attentions'

How do I solve this?
I am using "0.1.6" version of AutoAWQ and downgraded transformers to 4.34. Torch and other libraries are of the version mentioned in the setup.py file.

The text was updated successfully, but these errors were encountered:

casper-hansen · 2023-11-04T18:15:34Z

I see now that there is an incompatibility. MPT models used to have seamless compatibility, but they pushed updates to the model repositories that are not pushed into Huggingface transformers.

I will look into if there is a workaround for this.

vidhyat98 · 2023-11-04T18:26:05Z

Thanks, I'll be eagerly looking forward to your response! This AWQ conversion is a crucial step in my project :)

casper-hansen · 2023-11-04T18:37:07Z

If I were you, an easy thing to try is going many versions back to see if you can find one that works with transformers+AutoAWQ.

vidhyat98 · 2023-11-16T23:15:13Z

Hi! I tried with various versions of transformers from 4.30 to 4.36 against different autoawq versions (upto 0.1.7). Still getting the error..
This is mosaicml/mpt-30b-instruct being used as the base model.

casper-hansen mentioned this issue Nov 18, 2023

Fix MPT #206

Merged

casper-hansen closed this as completed in #206 Nov 18, 2023

datquocnguyen mentioned this issue Dec 17, 2023

Is there a quantization version support in the future? VinAIResearch/PhoGPT#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert QLoRA trained model to AWQ #155

Convert QLoRA trained model to AWQ #155

vidhyat98 commented Nov 4, 2023

casper-hansen commented Nov 4, 2023

vidhyat98 commented Nov 4, 2023

casper-hansen commented Nov 4, 2023

vidhyat98 commented Nov 16, 2023

Convert QLoRA trained model to AWQ #155

Convert QLoRA trained model to AWQ #155

Comments

vidhyat98 commented Nov 4, 2023

casper-hansen commented Nov 4, 2023

vidhyat98 commented Nov 4, 2023

casper-hansen commented Nov 4, 2023

vidhyat98 commented Nov 16, 2023