Customize Your Vector Database

By default, the Docker Compose files for the examples deploy Milvus as the vector database with CPU-only support. You must install the NVIDIA Container Toolkit to use Milvus with GPU acceleration. You can also extend the code to add support for any vector store.

Configure Milvus with GPU Acceleration
Configure Support for an External Milvus Database
Add a Custom Vector Store

Configure Milvus with GPU Acceleration

Use the following procedure to configure Milvus with GPU Acceleration.

In the deploy/vectordb.yaml file, change the image tag to include the -gpu suffix.

milvus:
  container_name: milvus-standalone
  image: milvusdb/milvus:v2.4.4-gpu
  ...

In the deploy/vectordb.yaml file, add the GPU resource reservation.

...
depends_on:
  - "etcd"
  - "minio"
deploy:
  resources:
    reservations:
      devices:
        - driver: nvidia
          capabilities: ["gpu"]
          device_ids: ['${VECTORSTORE_GPU_DEVICE_ID:-0}']
profiles: ["nemo-retriever", "milvus", ""]

Configure Support for an External Milvus Database

Use the following procedure to add support for an external Milvus database.

In the docker-compose.yaml file, remove or comment-out the include path to the vectordb.yaml file:
```
include:
  - path:
    # - ./vectordb.yaml
    - ./nims.yaml
```

In the docker-compose.yaml file, specify the connection information.

environment:
  APP_VECTORSTORE_URL: "http://<milvus-hostname-or-ipaddress>:19530"
  APP_VECTORSTORE_NAME: "milvus"
  ...

Restart the containers.

Add a Custom Vector Store

Use the following procedure to extend the code to add support for any vector store.

Navigate to the file src/utils.py in the project's root directory.

Modify the create_vectorstore_langchain function to handle your new vector store. Implement the logic for creating your vector store object within it.

def create_vectorstore_langchain(document_embedder, collection_name: str = "") -> VectorStore:
   # existing code
   elif config.vector_store.name == "chromadb":
      from langchain_chroma import Chroma
      import chromadb

      logger.info(f"Using Chroma collection: {collection_name}")
      persistent_client = chromadb.PersistentClient()
      vectorstore = Chroma(
         client=persistent_client,
         collection_name=collection_name,
         embedding_function=document_embedder,
      )

Update the get_docs_vectorstore_langchain function to retrieve a list of documents from your new vector store. Implement your retrieval logic within it.

def get_docs_vectorstore_langchain(vectorstore: VectorStore) -> List[str]:
   # Existing code
   elif  settings.vector_store.name == "chromadb":
      chroma_data = vectorstore.get()
      filenames = set([extract_filename(metadata) for metadata in chroma_data.get("metadatas", [])])
      return filenames

Update the del_docs_vectorstore_langchain function to handle document deletion in your new vector store.

def del_docs_vectorstore_langchain(vectorstore: VectorStore, filenames: List[str]) -> bool:
   # Existing code
   elif  settings.vector_store.name == "chromadb":
      chroma_data = vectorstore.get()
      for filename in filenames:
            ids_list = [chroma_data.get("ids")[idx] for idx, metadata in enumerate(chroma_data.get("metadatas", [])) if extract_filename(metadata) == filename]
            vectorstore.delete(ids_list)
      return True

In your chains.py implementation, import the preceding functions from utils.py. The sample chains.py already imports the functions.

from .utils import (
   create_vectorstore_langchain,
   get_docs_vectorstore_langchain,
   del_docs_vectorstore_langchain,
   get_vectorstore
)

Update requirements.txt and add any additional packages that the vector store requires.
```
# existing dependency
langchain-chroma
```
Navigate to the root directory.
Set the APP_VECTORSTORE_NAME environment variable for the rag-server microservice. Set it to the name of your newly added vector store.
```
export APP_VECTORSTORE_NAME: "chromadb"
```

Build and deploy the microservices.

docker compose -f deploy/compose/docker-compose.yaml up -d --build

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vector-database.md

vector-database.md

Customize Your Vector Database

Configure Milvus with GPU Acceleration

Configure Support for an External Milvus Database

Add a Custom Vector Store

Files

vector-database.md

Latest commit

History

vector-database.md

File metadata and controls

Customize Your Vector Database

Configure Milvus with GPU Acceleration

Configure Support for an External Milvus Database

Add a Custom Vector Store