error: torch.isnan(scales).sum() == 0 #203

namtranase · 2023-11-17T04:01:40Z

Thank you for the awesome repo. I have successfully run and tried it on the LLama model.
The problem occurs when I try to run the MPT model (7B), here are some issues, I hope u can help me with this:

With the model including bias, this line will report the mismatch between bias and scales -> I play around with it and fixed it
With the first one fixed, I can run the MPT model with small layers (I tested with 2 layers), but when testing with full 32 layers, I got the nan error in this line and it from this line
Thank you a lot for the awesome repo.

casper-hansen · 2023-11-18T13:53:49Z

I just pushed a fix for MPT models in #206. Other than that, I am not sure what the issue is here?

namtranase · 2023-11-25T05:08:51Z

Thank you for your reply, I fixed it by changing the proper dataset. I will close the issues

namtranase · 2023-12-22T10:06:51Z

Hi @casper-hansen, thank you a lot for your repo. I learned a lot. I tried to apply awq to llama.cpp, here is my PR. If you have time, please give me your comment on it.
PR

namtranase closed this as completed Nov 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error: torch.isnan(scales).sum() == 0 #203

error: torch.isnan(scales).sum() == 0 #203

namtranase commented Nov 17, 2023

casper-hansen commented Nov 18, 2023

namtranase commented Nov 25, 2023

namtranase commented Dec 22, 2023

error: torch.isnan(scales).sum() == 0 #203

error: torch.isnan(scales).sum() == 0 #203

Comments

namtranase commented Nov 17, 2023

casper-hansen commented Nov 18, 2023

namtranase commented Nov 25, 2023

namtranase commented Dec 22, 2023