Skip to content

Commit 8baf85e

Browse files
authored
[Doc] Compatibility matrix for mutual exclusive features (#8512)
Signed-off-by: Wallas Santos <wallashss@ibm.com>
1 parent 1a18238 commit 8baf85e

File tree

13 files changed

+467
-0
lines changed

13 files changed

+467
-0
lines changed

docs/source/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -86,6 +86,7 @@ Documentation
8686
serving/usage_stats
8787
serving/integrations
8888
serving/tensorizer
89+
serving/compatibility_matrix
8990
serving/faq
9091

9192
.. toctree::

docs/source/models/performance.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,8 @@ If you frequently encounter preemptions from the vLLM engine, consider the follo
2222

2323
You can also monitor the number of preemption requests through Prometheus metrics exposed by the vLLM. Additionally, you can log the cumulative number of preemption requests by setting disable_log_stats=False.
2424

25+
.. _chunked-prefill:
26+
2527
Chunked Prefill
2628
---------------
2729
vLLM supports an experimental feature chunked prefill. Chunked prefill allows to chunk large prefills into smaller chunks and batch them together with decode requests.

0 commit comments

Comments
 (0)