Skip to content

Commit

Permalink
Update "Download and deploy ELSER" snippet with adaptive allocations (#…
Browse files Browse the repository at this point in the history
…2878) (#2880)

(cherry picked from commit 5740148)

Co-authored-by: Liam Thompson <[email protected]>
  • Loading branch information
mergify[bot] and leemthompo authored Nov 27, 2024
1 parent e72d950 commit 29a10ed
Showing 1 changed file with 6 additions and 1 deletion.
7 changes: 6 additions & 1 deletion docs/en/stack/ml/nlp/ml-nlp-elser.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -124,14 +124,19 @@ PUT _inference/sparse_embedding/my-elser-model
{
"service": "elasticsearch",
"service_settings": {
"num_allocations": 1,
"adaptive_allocations": {
"enabled": true,
"min_number_of_allocations": 1,
"max_number_of_allocations": 10
},
"num_threads": 1,
"model_id": ".elser_model_2_linux-x86_64"
}
}
----------------------------------
--
The API request automatically initiates the model download and then deploy the model.
This example uses <<ml-nlp-auto-scale,autoscaling>> through adaptive allocation.

Refer to the {ref}/infer-service-elser.html[ELSER {infer} service documentation] to learn more about the available settings.

Expand Down

0 comments on commit 29a10ed

Please sign in to comment.