Fix tests for mps support by deathcoder · Pull Request #2005 · DLR-RM/stable-baselines3

deathcoder · 2024-09-14T16:15:12Z

Description

closes #914

When i started on the base branch feat/mps-support there were 45 failing tests that i now consider fixed, a few things to note:

in most cases i added a check (if mps device is available then i have to apply various casting to make sure tensors are float32 and remain float32) not sure if this approach is correct but happy to change it to something else that also works
i decided to skip test_float64_action_space tests entirely since float64 is not supported
this test test_save_load[True-SAC] only fails when running the full-suite or running all test_save_load tests (make pytest or python3 -m pytest -v -k 'test_save_load') if instead i run the the single breaking test (python3 -m pytest -v -k 'test_save_load[True-SAC]') then it passes 🤷‍♂️ i also run the test file in pycharm and it passes there too so i'm not sure what the issue is, i can add the stacktace of the failing test in a comment if needed
i'm not sure about a few things regarding this template, i think these are not breaking changes but for example i force a cast in vec_normalize:normalize_reward that maybe is considered breaking?
i also looked into the changelog but i couldnt figure out how to edit it

Here the full list of fixed tests

Unsupported tests fixed by skipping

Motivation and Context

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Note: You can run most of the checks using make commit-checks.

Note: we are using a maximum length of 127 characters per line

Attempt fix ci: only cast reward from float64 to float32

araffin · 2024-09-18T12:31:18Z

Hello,
thanks for having a look at that.
Apart from some tests failing, does the algorithms work in normal conditions? (for instance PPO("MlpPolicy", "Pendulum-v1", device="mps").learn(10_000))

(In theory, if pytorch supports MPS properly, you would only need to specify the device)

deathcoder · 2024-09-18T15:09:42Z

hey 👋 yes that works, i have also tested A2C both on this branch, i'm still a beginner in this so i cant really say if all advanced use cases also work, but i think having the tests passing is a good indicator

…support

araffin · 2024-11-18T14:23:58Z

I think most issues are related to numpy v2, and should be fixed in #2041 too.

araffin · 2024-11-18T15:03:39Z

stable_baselines3/common/envs/bit_flipping_env.py

            # The internal state is the binary representation of the
            # observed one
-            return int(sum(state[i] * 2**i for i in range(len(state))))
+            return int(sum(int(state[i]) * 2**i for i in range(len(state))))


should not be needed anymore (because of the cast)

Fix tests

1c25053

deathcoder mentioned this pull request Sep 14, 2024

Use MPS device when available #951

Open

14 tasks

deathcoder and others added 3 commits September 17, 2024 17:41

Attempt fix ci: only cast reward from float64 to float32

f822ef5

allow running workflows from ui

1ac4a60

Merge pull request #2 from deathcoder/attempt-fix-ci

9970f51

Attempt fix ci: only cast reward from float64 to float32

Merge branch 'feat/mps-support' into feat/mps-support

5e7372d

araffin changed the base branch from feat/mps-support to master October 29, 2024 16:39

araffin changed the base branch from master to feat/mps-support October 29, 2024 16:40

Merge remote-tracking branch 'origin/feat/mps-support' into feat/mps-…

4c03a25

…support

araffin added the mac os label Nov 18, 2024

Merge branch 'feat/mps-support' into feat/mps-support

0ec37d8

araffin reviewed Nov 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tests for mps support#2005

Fix tests for mps support#2005
deathcoder wants to merge 7 commits intoDLR-RM:feat/mps-supportfrom
deathcoder:feat/mps-support

deathcoder commented Sep 14, 2024

Uh oh!

araffin commented Sep 18, 2024

Uh oh!

deathcoder commented Sep 18, 2024

Uh oh!

araffin commented Nov 18, 2024

Uh oh!

araffin Nov 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

deathcoder commented Sep 14, 2024

Description

Here the full list of fixed tests

Unsupported tests fixed by skipping

Motivation and Context

Types of changes

Checklist

Uh oh!

araffin commented Sep 18, 2024

Uh oh!

deathcoder commented Sep 18, 2024

Uh oh!

araffin commented Nov 18, 2024

Uh oh!

araffin Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants