Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Serverless]: Provide information on how users can manage ML VCU costs #229

Closed
ppf2 opened this issue Dec 4, 2024 · 0 comments · Fixed by #236
Closed

[Serverless]: Provide information on how users can manage ML VCU costs #229

ppf2 opened this issue Dec 4, 2024 · 0 comments · Fixed by #236
Assignees

Comments

@ppf2
Copy link
Member

ppf2 commented Dec 4, 2024

Serverless Docs

Welcome to Elastic Serverless

Description

It can be helpful to add another bullet point under this section https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs that talks about the two ways to control the ML VCU costs:

  • Set adaptive resources to Low to allow ML to scale down to 0 # of allocations when there are no active inference requests
  • When using the inference API for Elasticsearch or ELSER, enable adaptive_allocations which will allow ML to scale down the models to 0 # of allocations when there are no active inference requests

Resources and additional context

https://www.elastic.co/guide/en/serverless/current/elasticsearch-billing.html#elasticsearch-billing-managing-elasticsearch-costs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants