Avoid memory leak in unit test driver #4249

roystgnr · 2025-09-10T13:59:13Z

If we add only a subset of tests to the runner (via --re or --deny_re options), we're careful to move the other tests to a "rejects" suite so they'll get deleted there, but the "supertest", the suite holding all the other tests, wasn't getting deleted in those cases.

This (on top of my earlier fixes, and using the MOOSE suppressions file for third-party library issues) gets selective valgrind runs of our unit tests clean for me. (all-tests runs are clean either way)

This isn't urgent to merge; I still need to run through our examples to look for valgrind issues too. This tiny leak is only a problem because it upsets valgrind, and until we're sure we're ready to make that be a big deal (add a --error-exitcode= option to our valgrind recipes), "do our unit tests upset valgrind" isn't a critical question.

If we add only a subset of tests to the runner (via --re or --deny_re options), we're careful to move the other tests to a "rejects" suite so they'll get deleted there, but the "supertest", the suite holding all the other tests, wasn't getting deleted in those cases. This (on top of my earlier fixes, and using the MOOSE suppressions file for third-party library issues) gets selective valgrind runs of our unit tests clean for me.

roystgnr · 2025-09-10T14:03:07Z

Scratch "gets selective valgrind runs of our unit tests clean for me" - I'm still seeing something else, and I'm not sure if it's just something I missed before or a regression from this PR.

jwpeterson

I had some questions that could probably be cleared up if I read the cppunit docs, but I chose not to 🤷‍♂️

jwpeterson · 2025-09-10T14:18:41Z

tests/driver.C

                                 allow_regex_string, allow_regex,
                                 deny_regex_string, deny_regex,
                                 runner, rejects);
  if (n_tests_added >= 0)
    libMesh::out << "--- Running " << n_tests_added << " tests in total." << std::endl;
+  if (n_tests_added != -12345)


Minor comment, but since this is our own magic number that we now actually have to refer to, it might make sense to use libMesh::invalid_uint or some other named constant.

jwpeterson · 2025-09-10T14:22:31Z

tests/driver.C

                                 allow_regex_string, allow_regex,
                                 deny_regex_string, deny_regex,
                                 runner, rejects);
  if (n_tests_added >= 0)
    libMesh::out << "--- Running " << n_tests_added << " tests in total." << std::endl;
+  if (n_tests_added != -12345)
+    owned_suite.reset(suite);


I guess I don't understand why we don't need to clean up suite when there's no tests added? From a surface level reading of the code it just looks like registry.makeTest() returns a dumb pointer whose lifetime we are expected to manage, and we were always leaking it before...

This should clean up suite when there's no tests added. That'll return n_tests_added == 0, and 0 != -12345, so we put suite in our unique_ptr and it gets cleaned up.

moosebuild · 2025-09-10T18:26:01Z

Job Coverage, step Generate coverage on f452222 wanted to post the following:

Coverage

	7cab9a	#4249 f45222
	Total	Total	+/-	New
Rate	64.76%	64.76%	+0.00%	-
Hits	76352	76353	+1	0
Misses	41556	41555	-1	0

Diff coverage report

Full coverage report

This comment will be updated on new commits.

roystgnr · 2025-09-10T20:06:45Z

There's definitely something weird going on here. This patch cleans up the leaked suite from TestFactoryRegistry::makeTest(), but it starts complaining about a test suite destructor hitting invalid accesses to already-freed memory from TestSuiteFactory<AllSecondOrderTest>::makeTest() in particular.

If every test class was complaining then I'd be sure I'm trying to do a double-free here, once from the runner and then once from recursive destruction from the suite, but it's just AllSecondOrderTest complaining?

No, wait, it's too much of a coincidence that the failing AllSecondOrderTest is first alphabetically. I must be trying to do a double-free here, but something about the first UB causes the destructor to skip the rest.

roystgnr added the do not merge label Sep 10, 2025

jwpeterson approved these changes Sep 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid memory leak in unit test driver #4249

Avoid memory leak in unit test driver #4249

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

jwpeterson left a comment

Uh oh!

jwpeterson Sep 10, 2025

Uh oh!

jwpeterson Sep 10, 2025

Uh oh!

roystgnr Sep 10, 2025

Uh oh!

moosebuild commented Sep 10, 2025

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

Uh oh!

Avoid memory leak in unit test driver #4249

Are you sure you want to change the base?

Avoid memory leak in unit test driver #4249

Uh oh!

Conversation

roystgnr commented Sep 10, 2025

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

jwpeterson left a comment

Choose a reason for hiding this comment

Uh oh!

jwpeterson Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

jwpeterson Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

roystgnr Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

moosebuild commented Sep 10, 2025

Coverage

Uh oh!

roystgnr commented Sep 10, 2025

Uh oh!

Uh oh!