Skip to content

Support cuda 12.8.0 and SBSA wheels #677

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 13 commits into
base: main
Choose a base branch
from

Conversation

johnnynunez
Copy link

@johnnynunez johnnynunez commented Jan 21, 2025

This pull request includes several updates to the publish.yaml workflow file to enhance compatibility and support for additional architectures, along with minor version updates and copyright changes.

Enhancements to the workflow file:

  • .github/workflows/publish.yaml: Updated the matrix to include ubuntu-22.04-arm for os and aarch64 for arch, adjusted the torch-version and cuda-version lists, and added exclusions to prevent incompatible combinations.
  • .github/workflows/publish.yaml: Added a new environment variable MATRIX_ARCH to capture the architecture.
  • .github/workflows/publish.yaml: Updated the actions for setting up swap space and installing CUDA to use the latest versions.
  • .github/workflows/publish.yaml: Modified the logic for determining the TORCH_CUDA_VERSION to include new versions and updated the hard-coded nightly build URLs.

Version updates:

Copyright updates:

  • Updated the year from 2024 to 2025 in multiple files, including tensor_parallel.py, block.py, mamba2.py, mamba2_simple.py, mha.py, mlp.py, ssd_minimal.py, k_activations.py, layer_norm.py, layernorm_gated.py, selective_state_update.py, ssd_bmm.py, ssd_chunk_scan.py, ssd_chunk_state.py, ssd_combined.py, and ssd_state_passing.py. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16]

Additional changes in setup.py:

  • setup.py: Added new imports, reorganized the code to include new functions for determining the build target and platform, and updated the logic for handling CUDA and ROCm builds. [1] [2] [3] [4]

Why this PR @tridao ?

Github: https://github.blog/changelog/2025-01-16-linux-arm64-hosted-runners-now-available-for-free-in-public-repositories-public-preview/
windows arm q2 2025: github/roadmap#1098
ubuntu 20.04 is deprecated from today
Devices: Digits, jetson thor, cuda arm laptops are coming
Nvidia is merging SBSA and ARM64 together
I add cuda 12.8.0 and arm runners on https://github.com/Jimver/cuda-toolkit/releases/tag/v0.2.21

@johnnynunez johnnynunez changed the title initial blackwell support Support cuda 12.8.0 and SBSA wheels Feb 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant