install-embeddings

Need

eksctl (min. version 0.190.0)
aws cli
helm cli
kubectl
glasskube

Check AWS quota

Ensure you have quotas for

${gpu_count}*4 for On-Demand G and VT instances in the region of choice
At least 1 load-balancer per each model you want. (Not per server running)

Create eks cluster and install needed plugins

Modify the following lines in create_cluster.sh

To get your account id run

aws sts get-caller-identity

install-embeddings/create_cluster.sh

Lines 7 to 12 in d55047e

    
           account_id=555555555555 
        
           region=us-east-2 
        
           cluster_name=trieve-gpu 
        
           main_instance_type=t3.small 
        
           gpu_instance_type=g4dn.xlarge 
        
           gpu_count=1

Run ./create_cluster.sh to generate the cluster

Specify your embedding models

Modify embedding_models.yaml for the models that you want to use

Install the helm chart

helm upgrade -i embedding-release oci://registry-1.docker.io/trieve/embeddings-helm -f embedding_models.yaml

Get your model endpoints

kubectl get ing

Cleanup

helm uninstall embedding-release
./delete_cluster.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
README.md		README.md
create_cluster.sh		create_cluster.sh
delete_cluster.sh		delete_cluster.sh
embedding_models.yaml		embedding_models.yaml
nvdp.yaml		nvdp.yaml
nvidia-device-plugin.yaml		nvidia-device-plugin.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

install-embeddings

Check AWS quota

Create eks cluster and install needed plugins

Cleanup

About

Releases

Packages

Contributors 2

Languages

	account_id=555555555555
	region=us-east-2
	cluster_name=trieve-gpu
	main_instance_type=t3.small
	gpu_instance_type=g4dn.xlarge
	gpu_count=1

devflowinc/install-embeddings

Folders and files

Latest commit

History

Repository files navigation

install-embeddings

Check AWS quota

Create eks cluster and install needed plugins

Cleanup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages