-
Notifications
You must be signed in to change notification settings - Fork 459
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect result when running the README example #320
Comments
This is not a bug, but a limitation: Bend only has 24-bit numbers, so, results are always |
re-openning the issue regarding this problem:
can you please tell me which version of the cuda toolkit you're using? |
I'm facing the same issue, FWIW - run-cu says CUDA not available, but I do have CUDA and nvcc installed.
Interestingly, if I manually compile the output of gen-cu (same program as OP), I get an error:
I'm using WSL |
@pema99 oh sorry, can you try installing a newer version of the cuda toolkit to see if it works? (not the one from apt, apt's one is pretty old and lacks a lot of features) |
Installed 12.4, same thing
|
And still the same |
@pema99 can you try decreasing the number of (i'm assuming this is the root cause of the problem, considering we only catered to the 4090, since that is the best GPU there is currently, and the one we have) |
Currently using cuda 12.4, and the 550.67 NVIDIA drivers. |
@rubenjr0 not the driver, but the cuda compiler itself, can you type |
Sure, my bad. Here's the output:
|
Closing this and continuing on the HVM repo, considering this is an HVM issue. |
Hello! I think I've encountered a bug. When running this example from the readme:
The output is 0. I've tried
bend run
,bend run-c
, andbend gen-cu
(bend run-cu says cuda is not available, so I manually compile it with nvcc).The output on my machine when running
sum(24, 0)
is 8388608, but on equivalent Haskell and Python programs the programs return 140737479966720. The results start to diverge when depth>=13.I was wondering what could be causing these issues, both the incorrect result when depth>=13, and the result=0 when depth>=25.
My computer specs:
The text was updated successfully, but these errors were encountered: