Rename source format #525

Udayscode · 2025-03-28T16:45:21Z

Description

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?

This PR addresses issue #422, which requested renaming the source_software attribute to source_format for better clarity and consistency in the movement package.

What does this PR do?

This PR renames source_software to source_format across the codebase, including all relevant Python files, test files, and documentation. It updates function arguments, attributes, and references to ensure consistency.

References

Issue Automatically infer file format without requiring source_software #422: Automatically infer file format without requiring source_software #422

How has this PR been tested?

I set up the project locally using Conda as per the contribution guidelines. Initially, running pytest --cov=movement showed 630 passed and 70 failed tests. After updating the tests to reflect the source_software to source_format change, the results improved to 682 passed and 18 failed. The remaining failures include:

14 failures due to numpy.linalg.LinAlgError in filtering tests (unrelated to my changes).
2 failures due to KeyError: 'source_format' in test_sample_data.py (minor issue, possibly needs a small tweak).

Is this a breaking change?

No, this PR should not break existing functionality—it’s a rename of an attribute and its references. Downstream code using source_software will need to update to source_format.

Does this PR require an update to the documentation?

Yes, I’ve updated the documentation in the docs folder to replace source_software with source_format.

Checklist:

The code has been tested locally
Tests have been added to cover all new functionality
The documentation has been updated to reflect any changes
The code has been formatted with pre-commit

for more information, see https://pre-commit.ci

.github/workflows/test_and_deploy.yml

This reverts commit 9e596be.

sonarqubecloud · 2025-03-28T19:30:07Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

niksirbi

Thanks for taking an interest in contributing to movement @Udayscode — we really appreciate it.

Regarding the test failures:

I can’t reproduce the linalg test failures locally. If you're seeing them on the main branch as well, it might be worth opening an issue and posting the full error message so we can investigate further.
The test_sample_data failures are happening because the field is still named source_software in the metadata file for our sample datasets. That’s not your fault — you don’t have write access to our data repository. I could update the metadata myself, but we’ll need to discuss it with the rest of the team first, since that change would affect CI across all open PRs.

For now, you can apply the two suggestions I left in the review to get the tests passing. We’ll revisit the metadata issue once we’ve resolved the other areas of concern.

That brings me to the broader changes needed for this PR. Right now, the edits don’t go far enough — this requires more than a basic search-and-replace. Here’s what still needs to be addressed:

Please search for all mentions of the word “software” across the codebase. This includes docstrings and the documentation. The surrounding sentences often need rephrasing to reflect the change in terminology — it's not just a matter of renaming.
We no longer need "LightningPose" as a source_format option, and the corresponding functions can be removed:
- load_poses.from_lp_file()
- _ds_from_lp_or_dlc_file() — its logic should be merged into load_poses.from_dlc_file()

The rationale here is that while LightningPose and DeepLabCut are different software packages, they both use the same underlying file format — the “DeepLabCut” format. Since we’re now making source format, rather than source software, the primary categorisation, there’s no need to treat LightningPose as a separate case. The docs/source/input_output.md file must be also updated to reflect this shift.

Let me know if any of that is unclear.

niksirbi · 2025-03-31T16:11:55Z

movement/sample_data.py

            ds = load_module.from_file(
                file_paths[key],
-                source_software=metadata[filename]["source_software"],
+                source_format=metadata[filename]["source_format"],


This is to make the test_sample_data tests pass, until we change the name of this field to "source_format" also on our data repository.

Suggested change

source_format=metadata[filename]["source_format"],

source_format=metadata[filename]["source_software"],

niksirbi · 2025-03-31T16:19:54Z

tests/test_unit/test_sample_data.py

        "sha256sum",
        "type",
-        "source_software",
+        "source_format",


Needs to be changed back to source_software to make the tests pass (until we update the corresponding field in the sample data repository).

Suggested change

"source_format",

"source_software",

Udayscode and others added 3 commits March 26, 2025 01:14

Ensure Windows CI uses D: drive for neuroinformatics-unit#499

9e596be

Renamed source_software to source_format in code, tests, and docs;

bc7a71a

[pre-commit.ci] auto fixes from pre-commit.com hooks

77feb42

for more information, see https://pre-commit.ci

niksirbi reviewed Mar 28, 2025

View reviewed changes

.github/workflows/test_and_deploy.yml Outdated Show resolved Hide resolved

Udayscode mentioned this pull request Mar 28, 2025

Automatically infer file format without requiring source_software #422

Open

Udayscode added 2 commits March 28, 2025 23:39

Revert test_and_deploy.yml to main branch version

59bfcad

Revert "Ensure Windows CI uses D: drive for neuroinformatics-unit#499"

7d6ad36

This reverts commit 9e596be.

Udayscode requested a review from niksirbi March 28, 2025 19:30

niksirbi requested changes Mar 31, 2025

View reviewed changes

Udayscode mentioned this pull request Mar 31, 2025

LinAlgError: SVD did not converge in tests on main branch #532

Closed

niksirbi mentioned this pull request Apr 1, 2025

Ensure Windows CI uses D: drive #518

Closed

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rename source format #525

Rename source format #525

Uh oh!

Udayscode commented Mar 28, 2025

Uh oh!

Uh oh!

sonarqubecloud bot commented Mar 28, 2025

Uh oh!

niksirbi left a comment

Uh oh!

niksirbi Mar 31, 2025

Uh oh!

niksirbi Mar 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	source_format=metadata[filename]["source_format"],
	source_format=metadata[filename]["source_software"],

Rename source format #525

Are you sure you want to change the base?

Rename source format #525

Uh oh!

Conversation

Udayscode commented Mar 28, 2025

Description

References

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

Uh oh!

Uh oh!

sonarqubecloud bot commented Mar 28, 2025

Quality Gate passed

Uh oh!

niksirbi left a comment

Choose a reason for hiding this comment

Uh oh!

niksirbi Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

niksirbi Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants