Skip to content

Commit e3c9d54

Browse files
authored
fix: crash when pretraining_dataset with dispatch_batches is false (axolotl-ai-cloud#2558)
1 parent 9eba0ad commit e3c9d54

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

src/axolotl/utils/data/sft.py

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -134,10 +134,9 @@ def prepare_dataset(cfg, tokenizer, processor=None, preprocess_iterable=None):
134134
"csv", data_files=f.name, split="train", streaming=True
135135
)
136136
else:
137-
if is_local_main_process():
138-
iter_ds = load_dataset(
139-
path, streaming=True, split=split, name=name, data_files=data_files
140-
)
137+
iter_ds = load_dataset(
138+
path, streaming=True, split=split, name=name, data_files=data_files
139+
)
141140

142141
if skip:
143142
LOG.info(f"Skipping {skip} samples from the dataset")

0 commit comments

Comments
 (0)