NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"
online-optimization llm-routing sla-management lyapunov-optimization cost-aware-routing sla-guarantees
-
Updated
Sep 22, 2025 - Jupyter Notebook