Video RAG

Video Retrieval Extension for VLC
Video Retrieval feature for Shotcut(coming soon)

There are two parts to setting up this project. First is the setup of docker and python on Windows Subsystem for Linux. The second part is the setup of the Video Rag extension for VLC/Shotcut on Windows.

PART ONE: Visual RAG Setup on Window Subsystem for Linux (WSL)

In this part, you will setup WSL, install docker, and run VDMS in docker container.

A. Setup WSL for model server

https://learn.microsoft.com/en-us/windows/wsl/install

You can now install everything you need to run WSL with a single command. Open PowerShell or Windows Command Prompt in administrator mode by right-clicking and selecting "Run as administrator", enter the wsl --install command, then restart your machine.

wsl --install

Set up your Linux user info

B. Install Docker

https://docs.docker.com/desktop/install/ubuntu/

Set up Docker's apt repository

# Add Docker's official GPG key:
sudo apt-get update
sudo apt-get install ca-certificates curl
sudo install -m 0755 -d /etc/apt/keyrings
sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg -o /etc/apt/keyrings/docker.asc
sudo chmod a+r /etc/apt/keyrings/docker.asc

# Add the repository to Apt sources:
echo \
  "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.asc] https://download.docker.com/linux/ubuntu \
  $(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
  sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
sudo apt-get update

sudo apt-get install docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin

Verify that the Docker Engine installation is successful by running the hello-world image.

sudo docker run hello-world

If you are behind a proxy make sure to configure docker for proxies:

sudo mkdir -p /etc/systemd/system/docker.service.d
sudo nano /etc/systemd/system/docker.service.d/http-proxy.conf

Add the following configuration, replacing the proxy URL and port with your actual proxy details:

[Service]
Environment="HTTP_PROXY=http://your.proxy.address:port"
Environment="HTTPS_PROXY=http://your.proxy.address:port"
Environment="NO_PROXY=localhost,127.0.0.1"

Reload Systemd and Restart Docker

sudo systemctl daemon-reload
sudo systemctl restart docker

C. Start VDMS docker container (on WSL)

start VDMS DB as docker container. The folder "vdms_db" will store the database on the host system to enable persistent memory :

sudo docker run -d -p 55555:55555 --rm --name vdms-rag -v ./vdms_db:/vdms/build/db intellabs/vdms:latest

D. Set up visual rag repo (Windows OS)

Git clone the repo

git clone https://github.com/intel/openvino-ai-video-retrieval-analysis.git video_retrieval

create virtual env in the home directory: If python is not yet installed on the system, install python 3.12
```
python -m venv rag_env
```

Activate the virtual env and install the requirements:

rag_env\Scripts\activate
cd video_retrieval
pip install -r docs/requirements.txt

Install landchain-ai

git clone https://github.com/langchain-ai/langchain
pip install -e langchain/libs/community

pip install onnx==1.16.0 openvino==2024.3.0 openvino-dev==2024.3.0

Part TWO: Video RAG Server Setup (Windows OS)

A. Start video RAG

Go to the command window where you have rag_env activated
cd video_retrieval
Update videos path in docs/config.yaml to point to the folder where you have the videos to perform Rag on.
Start Video Rag server.
- For the very first time we will need to generate and store embeddings of the videos in VDMS. Use "-g generate" flag. This step will take few minutes depending on the number of videos. Also, if it doesn't find the OpenVINO clip IRs then wait for it to convert from pytorch to onnx to openvino. Currently NPU compilation takes 29 min (This is a one time step)
```
python VLC_video-retrieval-ui.py -c docs/config.yaml -g generate 
```
- For any subsequent runs, we just need to start the server as we already have the embeddings in the DB.
```
python VLC_video-retrieval-ui.py -c docs/config.yaml
```

B. Setup vlc rag lua extension

Install VLC
copy video_rag.lua and client_Rag to C:\Program Files\VideoLAN\VLC\lua\extensions.
After installation in a new terminal run
```
python -m pip install pywin32
```
Open VLC
Go to View -> Video Retrieval
Two ways to use vlc rag extension:
A. Pass the video folder and retrieve a video clip based on the prompt:
- Type a prompt and click on "Search"
B. Seek to a different position in playing video based on the prompt
- Open a video
- Hit Pause
- Enter a prompt that you want to look for and hit the search button

C. Setup video rag feature in shotcut (coming soon)

Acknowledgements

Video embeddings - https://github.com/ttrigui/GenAIExamples/tree/client_inference

License

There are two licenses associated with this project. The VLC lua sample code is under MIT and everything else is under Apache-2.0

Disclaimer

Ethical Guidelines for GenerativeAI: The Responsible AI (RAI) guidance for internal use of generative AI is to follow the Intel Generative AI Guidelines as they continue to evolve, reference the Ethics Guidance for Generative AI 1.0.docx that the RAI Council put together, and connect with the Generative AI Community of Practice to share BKMs as we learn more about this emerging technology. Please consult this guidance to develop guidelines for your participants.

Human Rights Disclaimer

Intel is committed to respecting human rights and avoiding causing or directly contributing to adverse impacts on human rights. See Intel’s Global Human Rights Policy. See Intel’s Global Human Rights Policy. The software or model from Intel is intended for socially responsible applications and should not be used to cause or contribute to a violation of internationally recognized human rights

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
docs		docs
embedding		embedding
utils		utils
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
LICENSE_vlc_lua_script.md		LICENSE_vlc_lua_script.md
README.md		README.md
SECURITY.md		SECURITY.md
VLC_video-retrieval-ui.py		VLC_video-retrieval-ui.py
client_RAG.py		client_RAG.py
compile_model_npu.py		compile_model_npu.py
video_rag.lua		video_rag.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video RAG

PART ONE: Visual RAG Setup on Window Subsystem for Linux (WSL)

A. Setup WSL for model server

B. Install Docker

C. Start VDMS docker container (on WSL)

D. Set up visual rag repo (Windows OS)

Part TWO: Video RAG Server Setup (Windows OS)

A. Start video RAG

B. Setup vlc rag lua extension

C. Setup video rag feature in shotcut (coming soon)

Acknowledgements

License

Disclaimer

Human Rights Disclaimer

About

Releases

Packages

Contributors 2

Languages

License

intel/openvino-ai-video-retrieval-analysis

Folders and files

Latest commit

History

Repository files navigation

Video RAG

PART ONE: Visual RAG Setup on Window Subsystem for Linux (WSL)

A. Setup WSL for model server

B. Install Docker

C. Start VDMS docker container (on WSL)

D. Set up visual rag repo (Windows OS)

Part TWO: Video RAG Server Setup (Windows OS)

A. Start video RAG

B. Setup vlc rag lua extension

C. Setup video rag feature in shotcut (coming soon)

Acknowledgements

License

Disclaimer

Human Rights Disclaimer

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages