Skip to content

Conversation

@Jason233333
Copy link
Contributor

PR types
BugFix

PR changes
Models

Description
Support for qwen model PipelineParallel training in RLInfra, fix bug during sequence parallel.

@paddle-bot
Copy link

paddle-bot bot commented Oct 22, 2025

Thanks for your contribution!

@Jason233333 Jason233333 force-pushed the develop branch 2 times, most recently from 8f6ab47 to 62321b7 Compare October 23, 2025 07:45
@Jason233333 Jason233333 changed the title BugFix: Set default batch_size=1 when empty in sequence parallelism BugFix: qwen model sequence parallel can not get batch size Oct 23, 2025
@Jason233333 Jason233333 force-pushed the develop branch 3 times, most recently from be376d6 to 26909d8 Compare October 24, 2025 02:06
@gongel gongel merged commit c621f0c into PaddlePaddle:develop Oct 24, 2025
9 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants