Adds information about the importance of adaptive allocations #1454

kosabogi · 2025-05-22T08:53:54Z

📸 Preveiw

Description

This PR updates the Inference integration documentation to:

Clearly state that not enabling adaptive allocations can result in unnecessary resource usage and higher costs.
Expand the scope of the page to cover not only third-party service integrations, but also the Elasticsearch service.

Related issue: #1393

szabosteve

It looks great! Left a couple of comments and suggestions.

explore-analyze/elastic-inference/inference-api.md

Co-authored-by: István Zoltán Szabó <szabosteve@gmail.com>

kosabogi · 2025-05-22T12:07:21Z

It looks great! Left a couple of comments and suggestions.

Thank you! I applied your suggestions in my latest commit.

ppf2 · 2025-05-23T19:53:22Z

Thanks! I think there are a few different aspects to this we will want to cover (cc: @arisonl @shubhaat )

Adaptive resources is enabled (from the UI):

Depending on the usage level selected, whether it is configured to search/ingest optimized and the Platform Type (ECH/ECE vs. Serverless), it may or may not autoscale down to 0 allocations when the load is low.

Adaptive resources is disabled (from the UI):

Even at the low usage level, there will still be at least 1 or 2 allocations depending on search/ingest optimized.

Adaptive allocations enabled (from the API):

If enabled, model allocations can scale down to 0 when the load is low unless the user has explicitly specified a >0 min_number_of_allocations setting.

Adaptive allocations disabled (from the API):

User defines the num_allocations used by the model.

leemthompo · 2025-05-26T10:13:05Z

Some things struck me here:

the separation between UI and API tabbed sections seems somewhat arbitrary since both are constrained by the same platform-specific infrastructure realities
the format forces readers to mentally cross-reference three variables (usage level, optimization type, platform) across multiple paragraphs
perhaps we could replace the entire tabbed prose section with a single table?
- some of the prose is vague and requires guesswork— might be better defined explicitly

Please disregard if the linked page contains the full details and we're happy to have general overview here :)

kosabogi · 2025-05-26T11:50:25Z

Some things struck me here:

the separation between UI and API tabbed sections seems somewhat arbitrary since both are constrained by the same platform-specific infrastructure realities

the format forces readers to mentally cross-reference three variables (usage level, optimization type, platform) across multiple paragraphs

perhaps we could replace the entire tabbed prose section with a single table?

some of the prose is vague and requires guesswork— might be better defined explicitly

Please disregard if the linked page contains the full details and we're happy to have general overview here :)

Thank you @ppf2 and @leemthompo for all of your suggestions!
I've updated the Adaptive allocations section by rewriting the content as a table to make it easier to scan and compare configurations across platform, usage level, and optimization type.
Let me know what you think!

alaudazzi

I left a minor suggestion, otherwise LGTM.

explore-analyze/elastic-inference/inference-api.md

Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com>

leemthompo · 2025-05-27T08:43:49Z

Thanks @kosabogi might be nice to get a final 👀 from @ppf2 and @shubhaat before merging :)

Adds information about the importance of adaptive allocations

1573b58

kosabogi requested a review from szabosteve May 22, 2025 08:53

kosabogi requested review from a team as code owners May 22, 2025 08:53

github-actions bot deployed to docs-preview May 22, 2025 08:57 View deployment

Fixes links

b142d84

github-actions bot deployed to docs-preview May 22, 2025 09:05 View deployment

szabosteve reviewed May 22, 2025

View reviewed changes

explore-analyze/elastic-inference/inference-api.md Outdated Show resolved Hide resolved

explore-analyze/elastic-inference/inference-api.md Outdated Show resolved Hide resolved

explore-analyze/elastic-inference/inference-api.md Outdated Show resolved Hide resolved

Update explore-analyze/elastic-inference/inference-api.md

df9f37c

Co-authored-by: István Zoltán Szabó <szabosteve@gmail.com>

github-actions bot deployed to docs-preview May 22, 2025 11:52 View deployment

Applies suggestions

3acb326

github-actions bot deployed to docs-preview May 22, 2025 12:07 View deployment

kosabogi requested a review from ppf2 May 22, 2025 14:01

Additional information

b5b0340

Applying suggestions

d0fdead

github-actions bot deployed to docs-preview May 26, 2025 11:48 View deployment

alaudazzi approved these changes May 27, 2025

View reviewed changes

explore-analyze/elastic-inference/inference-api.md Outdated Show resolved Hide resolved

Update explore-analyze/elastic-inference/inference-api.md

05a9a12

Co-authored-by: Arianna Laudazzi <46651782+alaudazzi@users.noreply.github.com>

github-actions bot deployed to docs-preview May 27, 2025 05:52 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds information about the importance of adaptive allocations #1454

Adds information about the importance of adaptive allocations #1454

Uh oh!

kosabogi commented May 22, 2025 •

edited

Loading

Uh oh!

szabosteve left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kosabogi commented May 22, 2025

Uh oh!

ppf2 commented May 23, 2025 •

edited

Loading

Uh oh!

leemthompo commented May 26, 2025

Uh oh!

kosabogi commented May 26, 2025

Uh oh!

alaudazzi left a comment

Uh oh!

Uh oh!

leemthompo commented May 27, 2025

Uh oh!

Uh oh!

Adds information about the importance of adaptive allocations #1454

Are you sure you want to change the base?

Adds information about the importance of adaptive allocations #1454

Uh oh!

Conversation

kosabogi commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📸 Preveiw

Description

Related issue: #1393

Uh oh!

szabosteve left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kosabogi commented May 22, 2025

Uh oh!

ppf2 commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leemthompo commented May 26, 2025

Uh oh!

kosabogi commented May 26, 2025

Uh oh!

alaudazzi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leemthompo commented May 27, 2025

Uh oh!

Uh oh!

kosabogi commented May 22, 2025 •

edited

Loading

ppf2 commented May 23, 2025 •

edited

Loading