-
Notifications
You must be signed in to change notification settings - Fork 46
Open
Description
I would like to train the agent throughout multiple episodes and force it to go back to initial position at the beginning of each episode. I am not able to reset the environment, however. The following 'pseudocode'
`
num_episodes = 10
episode_len = 30
env = marlo.init(join_token)
for ep in range(num_episodes):
print("Running episode {}".format(ep))
observation = env.reset()
done = False
t_iter = 0
while (not done) and (t_iter <= episode_len):
print(" t iter = {}".format(t_iter))
_action = env.action_space.sample()
obs, reward, done, info = env.step(_action)
if done:
break
t_iter += 1
`
seems to execute once and then hangs with the error message:
Running episode 1
WARNING:marlo.base_env_builder:Error on attempting to start mission : A mission is already running.
WARNING:marlo.base_env_builder:Will attempt again after 3 seconds.
repeated multiple times until the whole simulation is killed.
How do I force the agent to make a 'clean start' at the beginning of each training episode?
Metadata
Metadata
Assignees
Labels
No labels