-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: CUDA performances #6
Comments
Dear albop, |
No, I did not attempt to reactivate the code, mostly for lack of time and other distractions. One idea would be to tweak a bit the output of codegen. From this discussion, https://groups.google.com/a/continuum.io/forum/#!msg/numba-users/8Hn6GagrXXU/ivQa4JUzCgAJ , I gather there is nothing fundamentally wrong with this approach. |
Okay, that makes sense. I am a new to GPU coding, but would really like to try. So I will look into it the next few weeks and see what I can do. I will focus on the eval_linear function for the moment. |
Currently, the experiments with CUDA yield terrible performances on AWS/g2.
It may be because, calculations are made on 64 bits by default.
To do :
The text was updated successfully, but these errors were encountered: