[wip] Cubecl API and grayscale to rgb u8 kernel #239

edgarriba · 2025-02-24T10:00:40Z

No description provided.

omar-abdelgawad · 2025-04-01T21:33:18Z

Hey! I have a question since the free github actions can't support a GPU how would we be able to test the GPU backend for future implementations?

edgarriba · 2025-04-02T09:43:27Z

for now assume that we have to check with our own computers with a gpu

omar-abdelgawad · 2025-04-03T04:17:59Z

crates/kornia-imgproc/src/cubecl.rs

+#[cube(launch_unchecked)]
+fn gray_from_rgb_u8_cl_kernel(
+    src: &Array<Line<u8>>,
+    dst: &mut Array<Line<u8>>,
+    args: GrayFromRgbArgs,
+) {
+    let x = CUBE_POS_X * CUBE_DIM_X + UNIT_POS_X;
+    let y = CUBE_POS_Y * CUBE_DIM_Y + UNIT_POS_Y;
+
+    let cols = args.cols;
+    let rows = args.rows;
+
+    if x < cols && y < rows {
+        let idx = y * cols + x;
+        let r = u16::cast_from(src[3 * idx]);
+        let g = u16::cast_from(src[3 * idx + 1]);
+        let b = u16::cast_from(src[3 * idx + 2]);
+        let gray = u8::cast_from(((r * 77 + g * 150 + b * 29) + 128) >> 8);
+        dst[idx] = Line::new(gray);
+    }
+}


Edited:
I ran the same test successfully with the following implementation instead:

#[cube(launch_unchecked)] fn gray_from_rgb_u8_cl_kernel( src: &Array<Line<u8>>, dst: &mut Array<Line<u8>>, ) { let idx = ABSOLUTE_POS; if idx < dst.len() { let r = u16::cast_from(src[3 * idx]); let g = u16::cast_from(src[3 * idx + 1]); let b = u16::cast_from(src[3 * idx + 2]); let gray = u8::cast_from(((r * 77 + g * 150 + b * 29) + 128) >> 8); dst[idx] = Line::new(gray); } }

Should I make a PR to this branch or to the main branch? I am also revising the ImageCL abstraction layer for #251 from burn CubeTensor.

hi @omar-abdelgawad thanks for trying! Have you found a huge difference ? I tried at some point and was minimal. Happy to adopt your version in any case. I think for now I'll refactor this work into a separated experimental kornia-cubecl crate to isolate deps until it matures. This will happen between today and tomorrow

Have you found a huge difference ?

No The difference is a few microseconds in the mean time of 256x224 benchmark only since all the measurements are quite fast anyway. But I was just trying to write the kernel to be as concise as possible and also figure out what are the optimal cube count, cube dim, and vectorization to use. btw, I tried to write about my results in the burn Grayscale thread that you opened before. Is your current benchmark still slow?

with the recent changes of having the imagecl struct holding the runtime data everything got more clear, as the cubecl guys mentioned the bottleneck was in the host/device copy

I think for now I'll refactor this work into a separated experimental kornia-cubecl crate to isolate deps until it matures. This will happen between today and tomorrow

Maybe you can tell me the layout of what you want to be done and I will handle it if possible. I submitted my proposal for this project anyway so I would be more than happy to start making a PR. Also should I skip the gpu tests for CI passing?

hi, sorry for late reply here. Not sure if you want to keep contributing on this but possibly the best to move forward is to have it for now as separated experimental kornia-cubecl crate. As we are still figuring out the final api for the current Tensor/Image to be agnostic to the memory allocator and backend, we might want the cubecl Image as decoupled implementation

Sounds good. As for my contribution, I honestly don't have much free time lately but I don't mind contributing to this branch and keeping it updated with the main branch. for now, can I make PRs to this branch (cubecl-grayscale)?

edgarriba added 4 commits February 17, 2025 11:53

color: cubecl gray_from_color gpu

3bfb110

not crashing

c5e5ffc

simplify

84975dd

implement u8 kernels

ed29f66

edgarriba linked an issue Mar 9, 2025 that may be closed by this pull request

create ImageCL abstraction #251

Open

omar-abdelgawad reviewed Apr 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wip] Cubecl API and grayscale to rgb u8 kernel #239

[wip] Cubecl API and grayscale to rgb u8 kernel #239

Uh oh!

edgarriba commented Feb 24, 2025

Uh oh!

omar-abdelgawad commented Apr 1, 2025

Uh oh!

edgarriba commented Apr 2, 2025

Uh oh!

omar-abdelgawad Apr 3, 2025 •

edited

Loading

Uh oh!

omar-abdelgawad Apr 3, 2025

Uh oh!

edgarriba Apr 3, 2025

Uh oh!

omar-abdelgawad Apr 4, 2025

Uh oh!

edgarriba Apr 4, 2025

Uh oh!

omar-abdelgawad Apr 5, 2025 •

edited

Loading

Uh oh!

edgarriba May 13, 2025

Uh oh!

omar-abdelgawad May 13, 2025

Uh oh!

Uh oh!

[wip] Cubecl API and grayscale to rgb u8 kernel #239

Are you sure you want to change the base?

[wip] Cubecl API and grayscale to rgb u8 kernel #239

Uh oh!

Conversation

edgarriba commented Feb 24, 2025

Uh oh!

omar-abdelgawad commented Apr 1, 2025

Uh oh!

edgarriba commented Apr 2, 2025

Uh oh!

omar-abdelgawad Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

omar-abdelgawad Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

edgarriba Apr 3, 2025

Choose a reason for hiding this comment

Uh oh!

omar-abdelgawad Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

edgarriba Apr 4, 2025

Choose a reason for hiding this comment

Uh oh!

omar-abdelgawad Apr 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

edgarriba May 13, 2025

Choose a reason for hiding this comment

Uh oh!

omar-abdelgawad May 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

omar-abdelgawad Apr 3, 2025 •

edited

Loading

omar-abdelgawad Apr 5, 2025 •

edited

Loading