Skip to content

support nvidia gpu mig resources #258

@pisko1979

Description

@pisko1979

Hi all

thanks for time to make tools, great job!
we are using your tool to track gpu resources that are allocated to pods/nodes. it works great with standard
nvidia.com/gpu resource description.

we have nVidia H100 GPU and configured it to use MIG partition so we can run multiple pods on one gpu. it works great.
but after change, noticed that kubectl-view-allocation doesnt "find new virtual gpus" as they are registered on node as new Capacity nvidia.com/mig-xxx

this is describe of node having one H100 and applied mixed, all-balanced mig profile.

nvidia.com/gpu: 0
nvidia.com/mig-1g.12gb: 2
nvidia.com/mig-2g.24gb: 1
nvidia.com/mig-3g.47gb: 1

as there are no gpu (nvidia.com/gpu count 0) kubectl-view-allocation doesnt even list node that have GPU in it.
please check is there way to incude also nvidia.com/mig-*?

thanks
regards

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions