add `mamba_chunk_scan_combined` and `mamba_split_conv1d_scan_combined` tests #670

garrett361 · 2025-01-14T19:48:06Z

This PR adds correctness tests for mamba_chunk_scan_combined and mamba_split_conv1d_scan_combined, which seemed to be missing. Forwards and backwards are tested against their reference implementations. Correctness when providing seq_idx is also tested.

garrett361 · 2025-01-21T19:48:49Z

@tridao I know the kernels inside of mamba_chunk_scan_combined and mamba_split_conv1d_scan_combined are individually tested, but I thought it would be worth it to add these more end-to-end tests. Thoughts?p

peterbjorgensen · 2025-04-09T09:45:18Z

Any idea why the tolerances need to be that high?
Those tolerances seem very high for float32.
It is probably related to #683 #571

garrett361 · 2025-04-09T14:02:41Z

Yes, concerningly high, at least for the backwards where some tests need tol = 1e-1 and/or are sensitive to seeds.

My first suspicion was that it is an issue with the tests, rather than the kernels, but I haven't found any problems yet. And since the forwards tests pass at reasonable-ish 1e-2/1e-3 levels, any error would need to be a bit subtle.

I have also found some non-determinism with the backwards passes for the D grads. Haven't posted about it yet; will try to today.

garrett361 · 2025-04-09T14:27:13Z

Also, also this is relevant: non-determinism is expected in the backwards due to atomic adds, apparently.

karannb · 2025-04-15T21:16:46Z

Any idea why the tolerances need to be that high? Those tolerances seem very high for float32. It is probably related to #683 #571

Hi, thanks for mentioning this. I posted a solution for my case in #571 , you might want to check that. I was able to manage tolerances upto 1e-8 for all gradients and outputs.

garrett361 added 5 commits January 11, 2025 19:42

add tests

5d468df

minimize changes

f360636

imports and other fixes

059718d

move tests to opts/triton/test_ssd.py

64750bc

restore ssd_minimal.py

8f29260

bhack mentioned this pull request Jan 22, 2025

[EXPORT AOTI] aoti_compile_and_package custom_ops dependecies pytorch/pytorch#145394

Open

comment on test init

b527279

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add `mamba_chunk_scan_combined` and `mamba_split_conv1d_scan_combined` tests #670

add `mamba_chunk_scan_combined` and `mamba_split_conv1d_scan_combined` tests #670

Uh oh!

garrett361 commented Jan 14, 2025

Uh oh!

garrett361 commented Jan 21, 2025

Uh oh!

peterbjorgensen commented Apr 9, 2025

Uh oh!

garrett361 commented Apr 9, 2025

Uh oh!

garrett361 commented Apr 9, 2025

Uh oh!

karannb commented Apr 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add mamba_chunk_scan_combined and mamba_split_conv1d_scan_combined tests #670

Are you sure you want to change the base?

add mamba_chunk_scan_combined and mamba_split_conv1d_scan_combined tests #670

Uh oh!

Conversation

garrett361 commented Jan 14, 2025

Uh oh!

garrett361 commented Jan 21, 2025

Uh oh!

peterbjorgensen commented Apr 9, 2025

Uh oh!

garrett361 commented Apr 9, 2025

Uh oh!

garrett361 commented Apr 9, 2025

Uh oh!

karannb commented Apr 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add `mamba_chunk_scan_combined` and `mamba_split_conv1d_scan_combined` tests #670

add `mamba_chunk_scan_combined` and `mamba_split_conv1d_scan_combined` tests #670