# Add fmperf Library and Update Dependencies #42

wangchen615 · 2025-03-27T18:27:04Z

This PR adds the fmperf library and updates project dependencies to support it.

Changes

Added fmperf library with its core components:
- Cluster.py: Kubernetes cluster management
- ModelSpecs.py: Model specification handling
- WorkloadSpecs.py: Workload configuration
- utils/: Utility modules for benchmarking, logging, and data processing
Updated project dependencies in pyproject.toml:
- Added pandas>=2.2.0 for data processing
- Added kubernetes>=29.0.0 for cluster management
- Added pyyaml>=6.0.1 for configuration handling
Fixed code quality issues:
- Added proper type hints and imports
- Fixed bare except statements
- Improved boolean comparisons
- Added __all__ exports for better module organization
- Fixed loop control variable conflicts

Features

Kubernetes cluster management for model deployment
Support for different workload types:
- Homogeneous workloads
- Heterogeneous workloads
- Realistic workloads
Benchmarking utilities for performance testing
Data processing and analysis tools

Testing

Verified all dependencies install correctly
Tested fmperf library functionality
Confirmed code quality checks pass

Notes

The fmperf library is designed to work with Kubernetes clusters
Supports both TGI and vLLM model deployments
Includes utilities for workload generation and performance analysis

Signed-off-by: Chen Wang <Chen.Wang1@ibm.com>

k8s-ci-robot · 2025-03-27T18:27:11Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: wangchen615

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [wangchen615]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

achandrasekar

Thanks for sending this out! Having the ability to deploy the model server and benchmark with different configurations makes sense. It would be good to get this working with the inference-perf library and clean up additional logic like report generation that is handled separately by inference-perf.

achandrasekar · 2025-03-31T21:53:29Z

fmperf/Cluster.py

+
+from kubernetes import client
+
+from fmperf.ModelSpecs import ModelSpec, TGISModelSpec, vLLMModelSpec


Should we call this library something else instead of fmperf? Maybe a name that makes it clear that it simplifies deployment of model server and the benchmarking tool?

@achandrasekar , what would be the good library name?

@achandrasekar , how about deployer?

Deployer sounds good to me.

achandrasekar · 2025-03-31T21:57:27Z

fmperf/WorkloadSpecs.py

+from fmperf.Cluster import DeployedModel
+
+
+class WorkloadSpec:


Can we have this deploy the inference-perf tool instead?

Will take a look at your tool. Thanks, @achandrasekar

achandrasekar · 2025-03-31T22:00:22Z

fmperf/utils/Parsing.py

+pd.set_option("future.no_silent_downcasting", True)
+
+
+def parse_results(results, print_df=False, print_csv=False):


Would be good to replace this with the reportgen in inference-perf.

- Combined dependencies from both branches - Preserved fmperf package configuration - Updated to latest upstream changes including new features and bug fixes - Resolved conflicts in pyproject.toml and pdm.lock

wangchen615 · 2025-07-31T21:54:01Z

I am closing this for now to first wait for the community decision: whether we still want to expand inference-perf to have the orchestration, given that the llm-d-benchmark already provides the harness for orchestration.

Add fmperf library and update dependencies

ece4dc9

Signed-off-by: Chen Wang <Chen.Wang1@ibm.com>

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 27, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 27, 2025

k8s-ci-robot requested review from ArangoGutierrez and SergeyKanzhelev March 27, 2025 18:27

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Mar 27, 2025

achandrasekar reviewed Mar 31, 2025

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 1, 2025

wangchen615 self-assigned this Jun 26, 2025

Merge upstream/main into fmperf branch

64e707b

- Combined dependencies from both branches - Preserved fmperf package configuration - Updated to latest upstream changes including new features and bug fixes - Resolved conflicts in pyproject.toml and pdm.lock

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jul 23, 2025

wangchen615 closed this Jul 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

# Add fmperf Library and Update Dependencies #42

# Add fmperf Library and Update Dependencies #42

wangchen615 commented Mar 27, 2025

Uh oh!

k8s-ci-robot commented Mar 27, 2025

Uh oh!

achandrasekar left a comment

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

wangchen615 May 22, 2025

Uh oh!

wangchen615 Jul 17, 2025

Uh oh!

achandrasekar Jul 21, 2025

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

wangchen615 May 22, 2025

Uh oh!

achandrasekar Mar 31, 2025

Uh oh!

wangchen615 commented Jul 31, 2025 •

edited

Loading

Uh oh!

Uh oh!


		from kubernetes import client

		from fmperf.ModelSpecs import ModelSpec, TGISModelSpec, vLLMModelSpec

		pd.set_option("future.no_silent_downcasting", True)


		def parse_results(results, print_df=False, print_csv=False):

# Add fmperf Library and Update Dependencies #42

# Add fmperf Library and Update Dependencies #42

Conversation

wangchen615 commented Mar 27, 2025

Changes

Features

Testing

Notes

Uh oh!

k8s-ci-robot commented Mar 27, 2025

Uh oh!

achandrasekar left a comment

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 May 22, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

achandrasekar Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 May 22, 2025

Choose a reason for hiding this comment

Uh oh!

achandrasekar Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

wangchen615 commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

wangchen615 commented Jul 31, 2025 •

edited

Loading