[flashinfer][fix] do not check nvcc availability #27990

mxz297 · 2025-11-03T16:19:53Z

Summary: #26443 adds checking of availability of nvcc as a condition to enable flashinfer moe. In our deployment env, there is no nvcc, so flashinfer moe is disabled
Differential Revision: D86104899

Summary: vllm-project#26443 adds checking of availability of nvcc as a condition to enable flashinfer moe. On devgpus, we may have nvcc so there is no issue. But in tw jobs, there is no nvcc, then flashinfer moe is disabled. Differential Revision: D86104899

gemini-code-assist

Code Review

This pull request aims to enable FlashInfer in environments where nvcc is not available by removing the nvcc availability check. While this addresses the issue for environments with pre-compiled kernels, it could introduce runtime crashes for users who lack both nvcc and pre-compiled kernels. I've suggested a safer alternative that makes the nvcc check conditional on the VLLM_HAS_FLASHINFER_CUBIN environment variable. This approach provides the desired flexibility for production environments while preserving the safeguard for other users.

mxz297 · 2025-11-03T16:22:26Z

@mgoin our internal prod environment uses flashinfer in an AOT fashion, and do not have nvcc. So right now we are seeing flashinfer moe being disabled internally, causing perf regression.

alecsolder · 2025-11-03T16:22:34Z

Is there a way we can add unit tests to ensure this doesn't get turned off accidentally again for the model?

heheda12345 · 2025-11-05T01:28:36Z

Is nvcc required by the jit compilation of FlashInfer?

gemini-code-assist bot reviewed Nov 3, 2025

View reviewed changes

mxz297 changed the title ~~do not check nvcc availability~~ [flashinfer][fix] do not check nvcc availability Nov 3, 2025

heheda12345 requested review from mgoin and youkaichao November 5, 2025 01:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[flashinfer][fix] do not check nvcc availability #27990

[flashinfer][fix] do not check nvcc availability #27990

mxz297 commented Nov 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mxz297 commented Nov 3, 2025 •

edited

Loading

Uh oh!

alecsolder commented Nov 3, 2025

Uh oh!

heheda12345 commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[flashinfer][fix] do not check nvcc availability #27990

Are you sure you want to change the base?

[flashinfer][fix] do not check nvcc availability #27990

Conversation

mxz297 commented Nov 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mxz297 commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alecsolder commented Nov 3, 2025

Uh oh!

heheda12345 commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mxz297 commented Nov 3, 2025 •

edited by github-actions bot

Loading

mxz297 commented Nov 3, 2025 •

edited

Loading