fix state values update bug in ch04/policy_eval.py #20

cutePanda123 · 2025-02-27T21:52:31Z

Issue
The existing code has a bug where the state values(array V) is being updated within the loop. As a result, new calculated values are based on the already updated values rather than the original values from previous state. This leads to incorrect calculations.

Fix
This pull request modifies the code to create a copy of the values before performing calculations. Instead of using the updated values within the loop, the new values are now derived from the copied state, ensuring correctness.

calculate new value states with old value states

08a77be

cutePanda123 changed the title ~~fix state values update bug in ch04/policy_eval.py code~~ fix state values update bug in ch04/policy_eval.py Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix state values update bug in ch04/policy_eval.py #20

fix state values update bug in ch04/policy_eval.py #20

Uh oh!

cutePanda123 commented Feb 27, 2025

Uh oh!

Uh oh!

Uh oh!

fix state values update bug in ch04/policy_eval.py #20

Are you sure you want to change the base?

fix state values update bug in ch04/policy_eval.py #20

Uh oh!

Conversation

cutePanda123 commented Feb 27, 2025

Uh oh!

Uh oh!