Skip to content

Conversation

shen-shanshan
Copy link
Collaborator

@shen-shanshan shen-shanshan commented Jul 25, 2025

What this PR does / why we need it?

  1. Move torchair_attention to torchair dir.
  2. Make AscendAttentionTorchairBackend extend AscendAttentionBackend to reduce duplicate methods.
  3. Make AscendTorchairMetadata extend AscendMetadata to reduce duplicate properties.

Does this PR introduce any user-facing change?

How was this patch tested?

Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

1 similar comment
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@shen-shanshan shen-shanshan changed the title [Misc] Refactor TorchAir Attention [3/N][Refactor] Refactor TorchAir Attention Aug 10, 2025
@shen-shanshan shen-shanshan changed the title [3/N][Refactor] Refactor TorchAir Attention [3/N][Refactor] Refactor torchair attention Aug 10, 2025
@shen-shanshan shen-shanshan marked this pull request as ready for review August 11, 2025 09:48
@shen-shanshan shen-shanshan changed the title [3/N][Refactor] Refactor torchair attention [3/N][Refactor] Move torchair_attention to torchair dir Aug 11, 2025
@shen-shanshan shen-shanshan changed the title [3/N][Refactor] Move torchair_attention to torchair dir [3/N][Refactor] Move torchair_attention to torchair dir Aug 11, 2025
Copy link

This pull request has conflicts, please resolve those before we can evaluate the pull request.

@shen-shanshan shen-shanshan added ready-for-test start test by label for PR accuracy-test enable all accuracy test for PR labels Aug 13, 2025
Copy link

codecov bot commented Aug 13, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (main@1b40665). Learn more about missing BASE report.
⚠️ Report is 11 commits behind head on main.

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #2017   +/-   ##
=======================================
  Coverage        ?   76.18%           
=======================================
  Files           ?      120           
  Lines           ?    13532           
  Branches        ?        0           
=======================================
  Hits            ?    10310           
  Misses          ?     3222           
  Partials        ?        0           
Flag Coverage Δ
unittests 76.18% <100.00%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Collaborator

@wangxiyuan wangxiyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please rebase to main to let the CI happy.

Signed-off-by: shen-shanshan <467638484@qq.com>
Signed-off-by: shen-shanshan <467638484@qq.com>
Signed-off-by: shen-shanshan <467638484@qq.com>
Signed-off-by: shen-shanshan <467638484@qq.com>
@wangxiyuan wangxiyuan merged commit 83e0f41 into vllm-project:main Aug 19, 2025
21 of 22 checks passed
wangxiaoteng888 pushed a commit to LCAIZJ/vllm-ascend that referenced this pull request Sep 25, 2025
…ject#2017)

### What this PR does / why we need it?

1. Move `torchair_attention` to `torchair` dir.
2. Make `AscendAttentionTorchairBackend` extend `AscendAttentionBackend`
to reduce duplicate methods.
3. Make `AscendTorchairMetadata` extend `AscendMetadata` to reduce
duplicate properties.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?


- vLLM version: v0.10.0
- vLLM main:
vllm-project/vllm@0933f9d

---------

Signed-off-by: shen-shanshan <467638484@qq.com>
chopper0126 pushed a commit to chopper0126/vllm-ascend that referenced this pull request Sep 26, 2025
…ject#2017)

### What this PR does / why we need it?

1. Move `torchair_attention` to `torchair` dir.
2. Make `AscendAttentionTorchairBackend` extend `AscendAttentionBackend`
to reduce duplicate methods.
3. Make `AscendTorchairMetadata` extend `AscendMetadata` to reduce
duplicate properties.

### Does this PR introduce _any_ user-facing change?

### How was this patch tested?


- vLLM version: v0.10.0
- vLLM main:
vllm-project/vllm@0933f9d

---------

Signed-off-by: shen-shanshan <467638484@qq.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
accuracy-test enable all accuracy test for PR module:core module:tests ready-for-test start test by label for PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants