We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 5a4b905 commit 27015faCopy full SHA for 27015fa
slm/pipelines/examples/contrastive_training/README.md
@@ -18,10 +18,11 @@ pip install -r slm/pipelines/examples/contrastive_training/requirements.txt
18
```
19
20
21
-下载 DuReader-Retrieval 中文数据集:
+下载 DuReader-Retrieval 和 MMarco-Retrieval 中文数据集:
22
23
cd data
24
wget https://paddlenlp.bj.bcebos.com/datasets/dureader_dual.train.jsonl
25
+python download_mmarco.py
26
27
28
## 训练
slm/pipelines/examples/contrastive_training/data/download_mmarco.py
@@ -22,9 +22,7 @@
print(len(dataset["train"]))
-fw = open(
- "/141nfs/lizhuoqun/PaddleNLP_1022/PaddleNLP/slm/pipelines/examples/contrastive_training/data/mmarco.jsonl", "w"
-)
+fw = open("./mmarco.jsonl", "w")
29
i = 0
30
for data in tqdm.tqdm(dataset["train"]):
0 commit comments