Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance improvements #139

Closed
svenseeberg opened this issue Jan 15, 2025 · 1 comment · Fixed by #184
Closed

Performance improvements #139

svenseeberg opened this issue Jan 15, 2025 · 1 comment · Fixed by #184
Labels
component:chat Chat Back End enhancement New feature or request

Comments

@svenseeberg
Copy link
Member

svenseeberg commented Jan 15, 2025

We can improve performance by parallelizing long running tasks, for example language classification and initial message checks. Document relevance checks can also be parallelized.

A relevance check with 3 documents costs about 20s.

@svenseeberg svenseeberg added enhancement New feature or request component:chat Chat Back End labels Jan 15, 2025
@svenseeberg svenseeberg changed the title Parallelize classification requests Performance improvements Jan 15, 2025
@dasgoutam
Copy link
Collaborator

Here are the results from the tests I performed for 1 query -

Query - "What can I do in Munich?"
Documents retrieved (avg) = 3
Documents retrieved after relevance check (avg) = 2

Single request
Avg. Time = 29s

Screenshot 2025-01-28 at 3 24 37 PM

Two Parallel requests

  1. Example 1

Request 1: 50 sec
Screenshot 2025-01-28 at 3 31 32 PM

Request 2: 1.02 min
Screenshot 2025-01-28 at 3 30 57 PM

  1. Example 2

Request 1: 52 sec
Screenshot 2025-01-28 at 3 32 51 PM

Request 2: 1.03 min
Screenshot 2025-01-28 at 3 32 33 PM

Three parallel requests*

  1. Example 1

Request 1: 1.06 min
Screenshot 2025-01-28 at 3 39 28 PM

Request 2: 1.19 min
Screenshot 2025-01-28 at 3 39 50 PM

Request 3: 1.32 min
Screenshot 2025-01-28 at 3 40 16 PM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:chat Chat Back End enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants