- What is Gurubase
- Features
- Quick Install
- How to Create a Guru
- How to Claim a Guru
- Showcase Your Guru
- How to Update Datasources
- License
- Help
- Used By
- Frequently Asked Questions
Gurubase is an open-source RAG system that lets you create AI-powered Q&A assistants ("Gurus") for any topic or need. Create a new Guru by adding:
- 📄 Webpages
- 📑 PDFs
- 🎥 YouTube videos
- 💻 GitHub repositories
Start asking questions directly on Gurubase, or embed it on your website to let your users ask questions about your product. It's already being used by hundreds of open-source repositories. You can also install the entire system on your server, check INSTALL.md for instructions on how to self-host Gurubase.
- 🤖 AI-Powered Q&A: Advanced LLM-based question answering, including instant evaluation mechanism to minimize hallucination as much as possible
- 🔄 RAG System: Retrieval Augmented Generation for accurate, context-aware responses
- 📚 Multiple Data Sources: Add web pages, PDFs, videos, and GitHub repositories as data sources for your Guru.
- 🔌 Easy Integration: Embeddable widget for your website. Discord and Slack Bots coming soon
- 🎯 Custom Gurus: Create specialized AI assistants for specific topics
- 🔄 Real-time Updates: Keep the data sources up to date by reindexing them with one click
- ⛬ Binge: Visualize your learning path while talking with a Guru. You can navigate through it and create a personalized path
- 🛠 Self-hosted Option: Full control over your deployment. Install the entire system on your servers
If you prefer not to use Gurubase.io, you can install the entire system on your own servers.
curl -fsSL https://raw.githubusercontent.com/Gurubase/gurubase/refs/heads/master/gurubase.sh -o gurubase.sh
bash gurubase.sh
See INSTALL.md for detailed installation instructions and prerequisites.
Currently, only the Gurubase team can create a Guru on Gurubase.io. Please open an issue on this repository with the title "Guru Creation Request" and include the GitHub repository link in the issue content. We prioritize Guru creation requests from the maintainers of the tools. Please mention whether you are the maintainer of the tool. If you are not the maintainer, it would be helpful to obtain the maintainer's permission before opening a creation request for the tool.
Although you can't create a Guru on Gurubase.io, you can manage it on Gurubase. For example, you can add, remove, or reindex the datasources. To claim a Guru, you must have a Gurubase account and be one of the tool's maintainers. Please open an issue with the title "Guru Claim Request". Include the link to the Guru (e.g., https://gurubase.io/g/anteon
), your Gurubase username, and a link proving you are one of the maintainers of the tool, such as a PR merged by you.
Add an "Ask AI" widget to your website by importing a small JS script. For an example, check the Anteon docs.
Like hundreds of GitHub repositories, add a badge to your README to guide your users to learn about your tool on Gurubase.
[![Gurubase](https://img.shields.io/badge/Gurubase-Ask%20OpenCost%20Guru-006BFF)](https://gurubase.io/g/opencost)
Datasources can include your tool's documentation webpages, YouTube videos, or PDF files. You can add new ones, remove existing ones, or reindex them. Reindexing ensures your Guru is updated based on changes to the indexed datasources. For example, if you update your tool's documentation, you can reindex those pages so your Guru generates answers based on the latest data.
Once you claim your Guru, you will see your Gurus in the "My Gurus" section.
Click the Guru you want to update. On the edit page, click "Reindex" for the datasource you want to reindex.
You can also see the "Last Index Date" on the URL pages.
Licensed under the Apache 2.0 License.
All the content generated by gurubase.io aligns with the license of the datasources used to generate answers. More details can be found on the Terms of Usage page, Section 2.
We prefer Discord for written communication. Join our channel! To stay updated on new features, you can follow us on X, Mastodon, and Bluesky.
Gurubase currently hosts hundreds of Gurus, and it grows every day. Here are some repositories that showcase their Gurus in their READMEs or documentation.
Gurubase is an open-source RAG system that creates AI-powered Q&A assistants ("Gurus"). It processes various data sources like web pages, videos, PDFs, and GitHub code repositories to provide context-aware answers.
Gurubase uses a modern RAG architecture:
- Indexing: Processes and chunks data sources
- Embedding: Converts text into vector representations
- Storage: Stores vectors in Milvus for efficient similarity search
- Retrieval: Finds relevant context when questions are asked
- Generation: Uses LLMs to generate accurate answers based on retrieved context
- Evaluation: Evaluates the contexts to prevent hallucinations
Check the ARCHITECTURE.md file for more details.
Gurubase supports multiple data source types:
- 📄 Web Pages
- 📑 PDF Documents
- 🎥 YouTube Videos
- 💻 GitHub repositories for codebase indexing
- More formats coming soon! Open an issue if you want a new data source type.
Gurubase follows a microservices architecture, deployed as Docker compose.
- Frontend: Next.js 14 with TailwindCSS
- Backend: Django REST framework
- Vector Store: Milvus
- Message Queue: RabbitMQ
- Cache: Redis
- Database: PostgreSQL See ARCHITECTURE.md for details.
Minimum requirements:
- CPU: 4 cores
- RAM: 8GB
- Storage: 10GB SSD
- OS: Linux or macOS (Windows via WSL2) See INSTALL.md for detailed requirements.
- You can use it on Gurubase.io (or on Gurubase Self-hosted if you’ve installed it on your servers).
- You can embed an Ask AI widget into your website.
- You can add a Gurubase badge to your GitHub repository README.
- We will release an API soon.
Discord and Slack integrations are currently in development. Join our Discord for updates.
Binge lets you:
- Create personalized learning paths on any Guru.
- Ask follow-up questions to dive deeper into the content.
- Visualize your learning path on the Binge Map and navigate it easily and efficiently.
- Save your progress to pick up where you left off.
- Manual reindexing available anytime. Check How to Update Datasources section to learn more
- Periodic reindexing will be available soon
A public API is in development. Features will include:
- Question answering
- Data source management
- Analytics and usage stats Join our Discord for API release updates.
- Code is licensed under Apache 2.0
- All data is stored locally in self-hosted deployments including the API keys
- No data is sent to external servers except LLM API calls
- Optional telemetry can be disabled
Gurubase.io is a hosted version of Gurubase. It's a great way to get started with Gurubase without the hassle of self-hosting.