Skip to content

Conversation

@Li-Z-Q
Copy link
Contributor

@Li-Z-Q Li-Z-Q commented Nov 10, 2025

PR types

New features

PR changes

Models

Description

  1. 扩展训练代码
    paddlenlp/transformers/llm_embed/modeling.py
    paddlenlp/experimental/transformers/mistral/modeling.py
    slm/pipelines/examples/contrastive_training/train.py
    slm/pipelines/examples/contrastive_training/data/download_mmarco.py
    新增支持数据集:MMarcoRetrieval
    新增支持模型:bge-en-icl、LLARA-passage

  2. 扩展删层代码
    slm/pipelines/examples/contrastive_training/shortgpt_prune.py
    新增支持模型:NV-Embed-v1、bge-en-icl、LLARA-passage、Qwen3-embedding

  3. 更新README
    slm/pipelines/examples/contrastive_training/README.md

@paddle-bot
Copy link

paddle-bot bot commented Nov 10, 2025

Thanks for your contribution!

@Li-Z-Q Li-Z-Q changed the title final update expand training and shortgpt_prune code to support more model Nov 11, 2025
Copy link
Collaborator

@DrownFish19 DrownFish19 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Li-Z-Q Li-Z-Q requested a review from DrownFish19 November 11, 2025 11:26
@DrownFish19 DrownFish19 changed the title expand training and shortgpt_prune code to support more model [Embedding] expand training and shortgpt_prune code to support more model Nov 11, 2025
@DrownFish19 DrownFish19 changed the title [Embedding] expand training and shortgpt_prune code to support more model [Embedding] Expand training and shortgpt_prune code to support more model Nov 11, 2025
@swgu98 swgu98 merged commit 4eef107 into PaddlePaddle:develop Nov 12, 2025
8 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants