-
Notifications
You must be signed in to change notification settings - Fork 244
Pull requests: JuliaGPU/CUDA.jl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Trial support for thread-block clusters
cuda kernels
Stuff about writing CUDA kernels.
enhancement
New feature or request
needs changes
Changes are needed.
needs documentation
Documentation is requested.
Add a note suggesting users prefer PTX MMA over WMMA
cuda kernels
Stuff about writing CUDA kernels.
documentation
Improvements or additions to documentation
performance
How fast can we go?
#2816
opened Jul 23, 2025 by
kshyatt
Loading…
Use GPUArrays accumulation implementation
cuda kernels
Stuff about writing CUDA kernels.
#2813
opened Jul 20, 2025 by
christiangnrd
Loading…
1 task
Restore Enzyme to CI checks
ci
Everything related to continuous integration.
needs changes
Changes are needed.
#2807
opened Jul 1, 2025 by
wsmoses
Loading…
fix conversion of 0x0 CuSparseMatrixCSC <-> CSR
bugfix
This gets something working again.
cuda libraries
Stuff about CUDA library wrappers.
#2806
opened Jul 1, 2025 by
tam724
Loading…
fixes the
kron
implementation for sparse + diagonal matrix
#2804
opened Jun 27, 2025 by
tam724
Loading…
Expand eigen() and add eig[vals,vecs]()
cuda libraries
Stuff about CUDA library wrappers.
enhancement
New feature or request
needs tests
Tests are requested.
#2787
opened May 26, 2025 by
matteosecli
Loading…
Added new api and fixed type errors in cuStateVec
cuda libraries
Stuff about CUDA library wrappers.
enhancement
New feature or request
needs changes
Changes are needed.
Try fast linear indexes for KA
enhancement
New feature or request
needs changes
Changes are needed.
performance
How fast can we go?
Allow disabling the linking of libdevice in CUDACompilerParams
enhancement
New feature or request
needs changes
Changes are needed.
speculative
Not sure about this one yet.
make CUDA randn work with Zygote
enhancement
New feature or request
needs changes
Changes are needed.
Directed rounding
cuda kernels
Stuff about writing CUDA kernels.
enhancement
New feature or request
needs tests
Tests are requested.
[CUSPARSE] Fix constructor of sparse empty matrices
bugfix
This gets something working again.
cuda libraries
Stuff about CUDA library wrappers.
#2575
opened Dec 2, 2024 by
amontoison
•
Draft
WIP: Native I/O.
cuda kernels
Stuff about writing CUDA kernels.
speculative
Not sure about this one yet.
High Level Wrapper for Fused Matmul + Bias + Activation
cuda libraries
Stuff about CUDA library wrappers.
enhancement
New feature or request
Use PrecompileTools to warmup CUDA.jl
enhancement
New feature or request
needs changes
Changes are needed.
Add a dispatch for LinearAlgebra.norm2
cuda array
Stuff about CuArray.
good first issue
Good for newcomers
needs changes
Changes are needed.
Support FFT adjoint plans and test
cuda libraries
Stuff about CUDA library wrappers.
enhancement
New feature or request
#2073
opened Sep 4, 2023 by
gaurav-arya
•
Draft
Add contract through FastmathOverlays.jl
cuda kernels
Stuff about writing CUDA kernels.
enhancement
New feature or request
Previous Next
ProTip!
Follow long discussions with comments:>50.