Skip to content

Issues: kubernetes-sigs/gateway-api-inference-extension

v0.4 Release Tracker
#681 opened Apr 13, 2025 by kfswain
Open 4
Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Align 002-api-proposal with current api/v1alpha2 Go Type Definitions needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#816 opened May 10, 2025 by SinaChavoshi
e2e: Add /chat/completions Test Case good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. triage/accepted Indicates an issue or PR is ready to be actively worked on.
#814 opened May 9, 2025 by danehans
Add a generic AddPlugin option to configure the scheduler needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#813 opened May 9, 2025 by liu-cong
Proposal: Add Explicit Status Conditions to InferencePool API Spec needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#806 opened May 8, 2025 by SinaChavoshi
Prefix Aware Scorer needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#801 opened May 8, 2025 by oglok
Add Inference Extension to vLLM Integrations Doc needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#794 opened May 7, 2025 by danehans
Add the ability for scheduling plugins to add/modify requests during request processing needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#791 opened May 7, 2025 by shmuelk
EPP cannot serve /chat/completions API kind/bug Categorizes issue or PR as related to a bug. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#790 opened May 7, 2025 by delavet
Add Performance Benchmarking to Release Doc documentation Improvements or additions to documentation triage/accepted Indicates an issue or PR is ready to be actively worked on.
#787 opened May 6, 2025 by danehans
Support Semantic Processing using NLP models needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#770 opened May 1, 2025 by rootfs
Docs: YAML Example with multiple inference pools needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#769 opened May 1, 2025 by sriumcp
API Discrepancy: InferencePoolSpec pod selector field name mismatch between API Proposal 002 and current Go type definition documentation Improvements or additions to documentation kind/bug Categorizes issue or PR as related to a bug. triage/accepted Indicates an issue or PR is ready to be actively worked on.
#766 opened Apr 30, 2025 by SinaChavoshi
Enable Conformance Testing for Standalone (Non-Gateway API) EPP Implementations needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#753 opened Apr 28, 2025 by SinaChavoshi
metrics dashboard should be documented for options other than Google Managed Prometheus good first issue Denotes an issue ready for a new contributor, according to the "help wanted" guidelines. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. triage/accepted Indicates an issue or PR is ready to be actively worked on.
#747 opened Apr 26, 2025 by nirrozenbaum
Docs: Create EPP Operations Guide documentation Improvements or additions to documentation help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. triage/accepted Indicates an issue or PR is ready to be actively worked on.
#735 opened Apr 24, 2025 by danehans
Refactor Ext-proc server logic to better reflect EPP Layers triage/accepted Indicates an issue or PR is ready to be actively worked on.
#733 opened Apr 23, 2025 by kfswain
Benchmark Test Harness triage/accepted Indicates an issue or PR is ready to be actively worked on.
#732 opened Apr 23, 2025 by kfswain
Pod Metrics test flaked kind/bug Categorizes issue or PR as related to a bug. triage/accepted Indicates an issue or PR is ready to be actively worked on.
#719 opened Apr 21, 2025 by kfswain
make image-build takes long to build inference extension docker image triage/needs-information Indicates an issue needs more information in order to work on it.
#717 opened Apr 21, 2025 by nirrozenbaum
replace InferenceModel uniquness check in code with admission validation webhook needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#716 opened Apr 20, 2025 by nirrozenbaum
Proposal: EPP should support heterogeneous pods across the pool triage/needs-information Indicates an issue needs more information in order to work on it.
#715 opened Apr 20, 2025 by nirrozenbaum
InferencePool: Add BBR Config triage/accepted Indicates an issue or PR is ready to be actively worked on.
#711 opened Apr 18, 2025 by danehans
Implement Lightweight Scheduler Simulation Tests for Inference Gateway needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one.
#709 opened Apr 18, 2025 by kaushikmitr
Tools: Add BBR Extension Metrics to Dashboards triage/accepted Indicates an issue or PR is ready to be actively worked on.
#706 opened Apr 17, 2025 by danehans
Tools: Add Scheduler Plugin Metrics to Dashboards triage/accepted Indicates an issue or PR is ready to be actively worked on.
#705 opened Apr 17, 2025 by danehans
ProTip! What’s not been updated in a month: updated:<2025-04-10.