Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

error: torch.isnan(scales).sum() == 0 #203

Closed
namtranase opened this issue Nov 17, 2023 · 3 comments
Closed

error: torch.isnan(scales).sum() == 0 #203

namtranase opened this issue Nov 17, 2023 · 3 comments

Comments

@namtranase
Copy link

Thank you for the awesome repo. I have successfully run and tried it on the LLama model.
The problem occurs when I try to run the MPT model (7B), here are some issues, I hope u can help me with this:

  1. With the model including bias, this line will report the mismatch between bias and scales -> I play around with it and fixed it
  2. With the first one fixed, I can run the MPT model with small layers (I tested with 2 layers), but when testing with full 32 layers, I got the nan error in this line and it from this line
    Thank you a lot for the awesome repo.
@casper-hansen
Copy link
Owner

I just pushed a fix for MPT models in #206. Other than that, I am not sure what the issue is here?

@namtranase
Copy link
Author

Thank you for your reply, I fixed it by changing the proper dataset. I will close the issues

@namtranase
Copy link
Author

Hi @casper-hansen, thank you a lot for your repo. I learned a lot. I tried to apply awq to llama.cpp, here is my PR. If you have time, please give me your comment on it.
PR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants