We have predefined the task settings mentioned in the paper in the ./envs/tasks/task_specs.yaml file. Each task includes configurations for three stages: data collection, world model and behavior learning, and testing. For example, the task Harvest log in plains is defined as collect_rollouts_harvest_log_in_plains, harvest_log_in_plains, and test_harvest_log_in_plains for these three stages, respectively.
This document introduces the meaning and configuration guidelines for each field in the task settings, enabling you to define your custom tasks by following these examples.
-
task_id: Generally set toharvest. For more details, refer to Task Customization. -
sim: Typically set tominedojo. -
fast_reset: For more information, refer to Reset Mode.- During world model and behavior learning,
fast_resetis often set to5or10to stabilize the training process. - During data collection and testing,
fast_resetis usually set to0to collect more diverse data.
- During world model and behavior learning,
-
sim_specs: Details of the environment setup. See Task Customization for more information.target_names: Specifies the names of the target items forharvesttasks.target_quantities: Specifies the quantities of the target items forharvesttasks.specified_biome: Defines the initial biome of the environment. See Specified Biome.break_speed_multiplier: Adjusts the breaking speed of attacks. See Set Breaking Speed. Commonly set to100to simplify training.initial_inventory: Specifies the initial inventory setup. See Initial Inventory.initial_mobs: Specifies the initial mobs in the environment. See Spawn Mobs.
-
clip_specs: Configuration for usingMineCLIPto calculate intrinsic rewards.prompts: Short textual descriptions of the task, specified aslist[str].
-
concentration_specs: Configuration for calculating intrinsic rewards based on affordance maps and jumping flags.prompts: Short textual descriptions of the task, specified aslist[str].unet_checkpoint_dir: Path to the multimodal U-Net weight files.
-
reward_specs: Specifies rewards for collecting different items. -
success_specs: Configures additional success criteria and actions upon success.reward: Additional reward upon success.any: Specifies that the task is considered successful if any of the listed conditions are met.all: Specifies that the task is considered successful if all listed conditions are met.
-
terminal_specs: Configures conditions for ending an episode.max_steps: Specifies the maximum number of steps per episode.
-
screenshot_specs: Configuration for saving observation images.reset_flag: Whether to save images duringreset. (Images during reset are often darker.)step_flag: Whether to save images during eachstep.save_freq: Interval between saved images on average.
-
LS_Imagine_specs: Settings for integrating the LS-Imagine algorithm with the MineDojo environment.repeat: Number of times a single action is repeated. Typically set to1.