Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

First steps to enable SYCL backend in Python Interface #155

Open
wants to merge 6 commits into
base: sycl-develop
Choose a base branch
from

Conversation

sommerlukas
Copy link
Collaborator

First implementation steps towards supporting the SYCL backend in the CUTLASS Python Interface.

The two main additions from this PR are:

  • Generating a suitable GEMM template for the CUTLASS 3.x API and Intel PVC as target.
  • Calling DPC++ instead of nvcc to compile device and host code.

The support is not complete yet, i.e., it is not yet possible to launch a kernel. Notable missing components include data transfer handling via SYCL and kernel launch.

Compilation is currently only possible with development versions of DPC++, the -fsycl-rtc-mode flag that was added to support CUTLASS nested parameter classes in free-function kernels as part of this work is not yet available in releases.

The activation of the SYCL backend via environment variable is a temporary solution, a follow-up will look into a cleaner solution.

@sommerlukas sommerlukas self-assigned this Nov 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants