[Serverless]: Provide information on how users can manage ML VCU costs #229

ppf2 · 2024-12-04T21:13:36Z

Serverless Docs

Welcome to Elastic Serverless

Description

It can be helpful to add another bullet point under this section https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs that talks about the two ways to control the ML VCU costs:

Set adaptive resources to Low to allow ML to scale down to 0 # of allocations when there are no active inference requests
When using the inference API for Elasticsearch or ELSER, enable adaptive_allocations which will allow ML to scale down the models to 0 # of allocations when there are no active inference requests

Resources and additional context

https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs

The text was updated successfully, but these errors were encountered:

shainaraskas self-assigned this Dec 10, 2024

shainaraskas linked a pull request Dec 10, 2024 that will close this issue

Machine learning trained model autoscaling recommendations for cost reduction #236

Merged

shainaraskas closed this as completed in #236 Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serverless]: Provide information on how users can manage ML VCU costs #229

[Serverless]: Provide information on how users can manage ML VCU costs #229

ppf2 commented Dec 4, 2024

[Serverless]: Provide information on how users can manage ML VCU costs #229

[Serverless]: Provide information on how users can manage ML VCU costs #229

Comments

ppf2 commented Dec 4, 2024

Serverless Docs

Description

Resources and additional context