Open-Source RAG Assistant
DocsGPT is an open-source genAI tool that helps users get reliable answers from any knowledge source, while avoiding hallucinations. It enables quick and reliable information retrieval, with tooling and agentic system capability built in.
- ๐๏ธ Wide Format Support: Reads PDF, DOCX, CSV, XLSX, EPUB, MD, RST, HTML, MDX, JSON, PPTX, and images.
- ๐ Web & Data Integration: Ingests from URLs, sitemaps, Reddit, GitHub and web crawlers.
- โ Reliable Answers: Get accurate, hallucination-free responses with source citations viewable in a clean UI.
- ๐ Actionable Tooling: Connect to APIs, tools, and other services to enable LLM actions.
- ๐งฉ Pre-built Integrations: Use readily available HTML/React chat widgets, search tools, Discord/Telegram bots, and more.
- ๐ Flexible Deployment: Works with major LLMs (OpenAI, Google, Anthropic) and local models (Ollama, llama_cpp).
- ๐ข Secure & Scalable: Run privately and securely with Kubernetes support, designed for enterprise-grade reliability.
You can find our roadmap here. Please don't hesitate to contribute or create issues, it helps us improve DocsGPT!
We're eager to provide personalized assistance when deploying your DocsGPT to a live environment.
Name | Base Model | Requirements (or similar) |
---|---|---|
Docsgpt-7b-mistral | Mistral-7b | 1xA10G gpu |
Docsgpt-14b | llama-2-14b | 2xA10 gpu's |
Docsgpt-40b-falcon | falcon-40b | 8xA10G gpu's |
If you don't have enough resources to run it, you can use bitsnbytes to quantize.
-
๐ ๐ฅ Cloud Version
-
๐ฌ ๐ Join our Discord
-
๐ ๐ Guides
-
๐ ๐ How to use any other documentation
-
๐ ๐ How to host it locally (so all data will stay on-premises)
-
Application - Flask app (main application).
-
Extensions - Chrome extension.
-
Scripts - Script that creates similarity search index for other libraries.
Note
Make sure you have Docker installed
On Mac OS or Linux, write:
./setup.sh
It will install all the dependencies and allow you to download the local model, use OpenAI or use our LLM API.
Otherwise, refer to this Guide for Windows:
-
Download and open this repository with
git clone https://github.com/arc53/DocsGPT.git
-
Create a
.env
file in your root directory and set the env variables andVITE_API_STREAMING
to true or false, depending on whether you want streaming answers or not. It should look like this inside:LLM_NAME=[docsgpt or openai or others] VITE_API_STREAMING=true API_KEY=[if LLM_NAME is openai]
See optional environment variables in the /.env-template and /application/.env_sample files.
-
Navigate to http://localhost:5173/.
To stop, just run Ctrl + C
.
For development, only two containers are used from docker-compose.yaml (by deleting all services except for Redis and Mongo). See file docker-compose-dev.yaml.
Run
docker compose -f docker-compose-dev.yaml build
docker compose -f docker-compose-dev.yaml up -d
Note
Make sure you have Python 3.12 installed.
- Export required environment variables or prepare a
.env
file in the project folder:- Copy .env-template and create
.env
.
- Copy .env-template and create
(check out application/core/settings.py
if you want to see more config options.)
- (optional) Create a Python virtual environment: You can follow the Python official documentation for virtual environments.
a) On Mac OS and Linux
python -m venv venv
. venv/bin/activate
b) On Windows
python -m venv venv
venv/Scripts/activate
- Download embedding model and save it in the
model/
folder: You can use the script below, or download it manually from here, unzip it and save it in themodel/
folder.
wget https://d3dg1063dc54p9.cloudfront.net/models/embeddings/mpnet-base-v2.zip
unzip mpnet-base-v2.zip -d model
rm mpnet-base-v2.zip
- Install dependencies for the backend:
pip install -r application/requirements.txt
- Run the app using
flask --app application/app.py run --host=0.0.0.0 --port=7091
. - Start worker with
celery -A application.app.celery worker -l INFO
.
Note
Make sure you have Node version 16 or higher.
- Navigate to the /frontend folder.
- Install the required packages
husky
andvite
(ignore if already installed).
npm install husky -g
npm install vite -g
- Install dependencies by running
npm install --include=dev
. - Run the app using
npm run dev
.
Please refer to the CONTRIBUTING.md file for information about how to get involved. We welcome issues, questions, and pull requests.
We as members, contributors, and leaders, pledge to make participation in our community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation. Please refer to the CODE_OF_CONDUCT.md file for more information about contributing.
The source code license is MIT, as described in the LICENSE file.
This project is supported by: