Do you plan to support heterogeneous schedules for FusedEmbeddingBag operations?

The [RecFlex](https://dl.acm.org/doi/10.1109/SC41406.2024.00047) paper points out that embedding tables within a fused embedding bag collection can be heterogeneous, such as having different embedding dimensions or access patterns (e.g., one-hot vs. multi-hot). Applying the same code schedule to all tables in the fused kernel could lead to sub-optimal performance.

I’m wondering if there is any plan to support generating and compiling kernels at runtime, so that different tables can use different code schedules, for both inference and training?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do you plan to support heterogeneous schedules for FusedEmbeddingBag operations? #4152

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Do you plan to support heterogeneous schedules for FusedEmbeddingBag operations? #4152

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions