Project Title

A FastAPI-based Web Application for Retrieval-Augmented Generation (RAG) using Qdrant and Local LlamaFile

Project Overview

This project updates an existing Azure-based Retrieval-Augmented Generation (RAG) web application by integrating the Qdrant vector database and a local LlamaFile model. The updates include:

Replacing Azure services with a local LlamaFile for text generation.
Generating and storing embeddings in Qdrant.
Using FastAPI to verify the functionality of the RAG implementation via an interactive web interface.

Key Features

Qdrant Integration: Qdrant serves as the vector database for storing and querying text embeddings.
Local LlamaFile: A lightweight, local alternative to Azure for generating responses.
FastAPI Interface: An intuitive API for interacting with the RAG setup.
Easy Verification: Access the /docs URL to test the /ask endpoint.

Prerequisites

Ensure you have the following installed:

Python 3.8+
Required Python dependencies (listed in requirements.txt)

Setup Instructions

Step 1: Clone the Repository

git clone https://github.com/APsenpai42/llamafile-qdrant-rag
cd llamafile-qdrant-rag

Step 2: Set Up the Environment

Create a virtual environment and activate it:

python -m venv .venv
source .venv/bin/activate # On Windows, use .venv\Scripts\activate

Install the required dependencies:
```
pip install -r requirements.txt
```
Run the local Llamafile.

Update the .env file to point to your local LlamaFile. Example:

LLAMAFILE_API_URL=http://127.0.0.1:8080  # Update with your Llamafile server URL
LLAMAFILE_API_KEY="your-llamafile-api-key"  # Replace with your actual API key if required

Step 3: Generate and Load Embeddings

For this project, we are using an in-memory Qdrant instance. The embeddings will be automatically loaded when running the application.

Step 4: Run the FastAPI Application

Start the application using Uvicorn:

uvicorn main:app --reload

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
examples/1-setup-application		examples/1-setup-application
webapp		webapp
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
wine-ratings.csv		wine-ratings.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title

Project Overview

Key Features

Prerequisites

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up the Environment

Step 3: Generate and Load Embeddings

Step 4: Run the FastAPI Application

Step 5: Verify the RAG Functionality

Testing and Validation

Acknowledgments

About

Releases

Packages

Languages

License

APsenpai42/llamafile-qdrant-rag

Folders and files

Latest commit

History

Repository files navigation

Project Title

Project Overview

Key Features

Prerequisites

Setup Instructions

Step 1: Clone the Repository

Step 2: Set Up the Environment

Step 3: Generate and Load Embeddings

Step 4: Run the FastAPI Application

Step 5: Verify the RAG Functionality

Testing and Validation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages