Fix slow vector field tests #1657

janfb · 2025-09-02T13:22:44Z

The slow vf tests were running for hours and were eventually killed by GH action runners. I believe there were some nested fixture calls causing this issue. This is now fixed and the runtime is "down" to 30min for the slow vf tests.

I also did some refactoring the the vf utils here and there to make it more transparent. IID inference for FMPE is working in parts, see #1656

codecov · 2025-09-02T13:29:29Z

Codecov Report

❌ Patch coverage is 97.05882% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 86.88%. Comparing base (ebcd68e) to head (be335fc).
⚠️ Report is 16 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
sbi/inference/posteriors/vector_field_posterior.py	95.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1657      +/-   ##
==========================================
+ Coverage   86.59%   86.88%   +0.28%     
==========================================
  Files         135      134       -1     
  Lines       10931    11909     +978     
==========================================
+ Hits         9466    10347     +881     
- Misses       1465     1562      +97

Flag	Coverage Δ
unittests	`86.88% <97.05%> (+0.28%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sbi/inference/potentials/score_fn_iid.py	`88.62% <ø> (ø)`
sbi/inference/potentials/vector_field_potential.py	`87.62% <100.00%> (ø)`
sbi/samplers/rejection/rejection.py	`87.75% <ø> (ø)`
sbi/samplers/score/diffuser.py	`91.80% <100.00%> (+5.13%)`	⬆️
sbi/inference/posteriors/vector_field_posterior.py	`79.10% <95.00%> (+0.36%)`	⬆️

... and 21 files with indirect coverage changes

janfb · 2025-09-02T13:52:59Z

CD is running here: https://github.com/sbi-dev/sbi/actions/runs/17409710472

manuelgloeckler · 2025-09-03T08:08:05Z

sbi/inference/potentials/vector_field_potential.py

@@ -130,7 +94,7 @@ def set_x(
        self,
        x_o: Optional[Tensor],
        x_is_iid: Optional[bool] = False,
-        iid_method: Literal["fnpe", "gauss", "auto_gauss", "jac_gauss"] = "auto_gauss",
+        iid_method: Optional[str] = None,


Why removing the Literals?

was causing type checker issues. As this is mostly used internally it's fine to relax the constraints to just str I think.

manuelgloeckler · 2025-09-03T08:20:42Z

sbi/samplers/score/diffuser.py

            if save_intermediate:
                intermediate_samples.append(samples)

+        # Check for NaN values after predictor
+        if torch.isnan(samples).any():
+            raise RuntimeError(


This is the new runtime error which fails the test, right?
This is already happening if a single sample in the batch becomes nan, which might is to strict.

yes, good point. but it can happen that all samples are NaN and it takes a very long time until this is detected by accept_reject. Thus, it's probably better to fix this detection problem and allow NaNs for some samples here. will look into that.

I moved this check to the posterior level where the final samples are passed on. Also, I changed it to .all(), because this caused the main issue: 'FMPE' with 'auto-gauss' was returning only NaNs. It seems NPSE with iid-sampling returns only sometimes NaNs.
Therefore, NPSE iid-score tests are passing now. FMPE iid-score tests are skipped, except for fnpe.

manuelgloeckler

Looks good overall, I will have a closer look into it later.

It could be that a single sample or so becomes nan and the newly introduce RuntimeError is raised. Previously, this wasnt a problem because this would be handled by the accept_reject, no?

… sampling warning

manuelgloeckler

Great thanks.

Mhh, from your description it could be some of the terminal points on FMPE which can have a singularity in the drift (i.e. in diffusion we add a nugget i.e. we go to exact 0, but not sure for FMPE). Will look after this in #1656 , but fine to merge this already.

Edit: Nvm, if its prior dependent then its likely something numerical with the marginal moments given a Uniform distribution.
Edit: Nvm, it is this.

janfb added 4 commits September 1, 2025 08:56

wip: fix vf tests

c12eeb8

adapt tests

a2809c5

refactor score utils, small fixes.

6a3a80c

refactor vf slow tests.

90850f2

janfb requested a review from manuelgloeckler September 2, 2025 13:22

janfb changed the title ~~Fix vf tests~~ Fix slow vector field tests Sep 2, 2025

janfb added 3 commits September 2, 2025 16:21

remove nan check during diffusion

bfd776e

move nan check to last diffusion step.

ba1724b

skip idd-score tests for npse as well

05b3478

manuelgloeckler reviewed Sep 3, 2025

View reviewed changes

move NaN check to posterior.sample level, update tests, fix rejection…

be335fc

… sampling warning

janfb mentioned this pull request Sep 4, 2025

iid tests for fmpe and score-based iid methods failing #1646

Closed

manuelgloeckler approved these changes Sep 4, 2025

View reviewed changes

janfb merged commit eae9cc9 into main Sep 4, 2025
13 checks passed

janfb deleted the fix-vf-tests branch September 4, 2025 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix slow vector field tests #1657

Fix slow vector field tests #1657

Uh oh!

janfb commented Sep 2, 2025 •

edited

Loading

Uh oh!

codecov bot commented Sep 2, 2025 •

edited

Loading

Uh oh!

janfb commented Sep 2, 2025 •

edited

Loading

Uh oh!

manuelgloeckler Sep 3, 2025 •

edited

Loading

Uh oh!

janfb Sep 3, 2025

Uh oh!

manuelgloeckler Sep 3, 2025

Uh oh!

janfb Sep 3, 2025

Uh oh!

janfb Sep 4, 2025

Uh oh!

manuelgloeckler left a comment

Uh oh!

manuelgloeckler left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Fix slow vector field tests #1657

Fix slow vector field tests #1657

Uh oh!

Conversation

janfb commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

janfb commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

manuelgloeckler Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janfb Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

janfb Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

janfb Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler left a comment

Choose a reason for hiding this comment

Uh oh!

manuelgloeckler left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

janfb commented Sep 2, 2025 •

edited

Loading

codecov bot commented Sep 2, 2025 •

edited

Loading

janfb commented Sep 2, 2025 •

edited

Loading

manuelgloeckler Sep 3, 2025 •

edited

Loading

manuelgloeckler left a comment •

edited

Loading