[Bugfix]:replace npu_incre_flash_attention with npu_fused_infer_atten… #7911
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
504 KB |
sha256:d6047176cdccf4cb5ec4408ddbcbb8172b8220f4714a334782c99d1aca3f3186
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
513 KB |
sha256:d7c36875b970352f536b199604029038bd8e59343515fd96bbf9cd253bc43132
|
|