[Bugfix]:replace npu_incre_flash_attention with npu_fused_infer_atten… #7925
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
504 KB |
sha256:43331057d5c0a6ea08380b587611ea26c0dd2fc0c11790481e6f56e6fb83b7c1
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
513 KB |
sha256:6525f3ae68655d2035db4ea44e2a767b535225ccef4d76f0f7bd59e6a86f2ecc
|
|