Skip to content

Conversation

anon189Ty
Copy link
Contributor

@anon189Ty anon189Ty commented Sep 12, 2025

What this PR does / why we need it?

If we want a performance improvement by cast the format of weights from ND to NZ in unquantized linear, we need to transpose the weight shape too. But will case some errors in current code when running some models such as DeepSeek-V3-w4a8. Currently temporarily roll back.

Does this PR introduce any user-facing change?

How was this patch tested?

Copy link
Contributor

Note

Gemini is unable to generate a review for this pull request due to the file types involved not being currently supported.

Copy link

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

  • A PR should do only one thing, smaller PRs enable faster reviews.
  • Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
  • Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

This reverts commit 7b2ecc1.

Signed-off-by: anon189Ty <Stari_Falcon@outlook.com>
Copy link

codecov bot commented Sep 12, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.89%. Comparing base (1bbb20e) to head (b3db04e).
⚠️ Report is 36 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #2893      +/-   ##
==========================================
+ Coverage   74.76%   74.89%   +0.12%     
==========================================
  Files         150      154       +4     
  Lines       20891    21241     +350     
==========================================
+ Hits        15620    15908     +288     
- Misses       5271     5333      +62     
Flag Coverage Δ
unittests 74.89% <100.00%> (+0.12%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@wangxiyuan
Copy link
Collaborator

756b8a1 merged by this already.

@wangxiyuan wangxiyuan closed this Sep 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants