Adjust LLM parameters to speed things up #361

MrOrz · 2025-02-07T20:07:31Z

The most recent model, gemini-2.0-flash-001, suffers from slow response issue, compared to its predecessor, the experimental gemini-2.0-flash-exp.

On the other hand, I found that gemini-1.5-pro-002 in asia-east1 (Taiwan) and asia-northeast1 (Tokyo) is actually as fast as gemini-2.0-flash-exp on us-cental1, and its quality is also pretty good.

This PR sets up the LLM model settings for LLM transcript to prioritize gemini-1.5-pro-002 in Asia, then gemini-2.0-flash-exp. For now, we ignore the latest gemini-2.0-flash-001 until it gets a significant speed boost.

…ation

coveralls · 2025-02-07T20:13:52Z

coverage: 83.059% (+0.006%) from 83.053%
when pulling 784b3f5 on llm-adjustment
into e188e81 on master.

MrOrz added 4 commits February 8, 2025 04:04

refactor: Update Gemini model list and VertexAI location configuration

87335c3

feat: Add model-location configuration for Vertex AI transcript gener…

b684004

…ation

style: Fix linting issues in util.js

f01dcd7

feat(util): select fast region with fast model that can transcribe well

784b3f5

MrOrz requested review from nonumpa, andyy0216 and bil4444 February 8, 2025 07:09

MrOrz self-assigned this Feb 8, 2025

MrOrz marked this pull request as ready for review February 8, 2025 07:09

bil4444 approved these changes Feb 8, 2025

View reviewed changes

MrOrz merged commit 51658aa into master Feb 8, 2025
4 checks passed

MrOrz deleted the llm-adjustment branch February 8, 2025 07:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust LLM parameters to speed things up #361

Adjust LLM parameters to speed things up #361

MrOrz commented Feb 7, 2025 •

edited

Loading

coveralls commented Feb 7, 2025

Adjust LLM parameters to speed things up #361

Adjust LLM parameters to speed things up #361

Conversation

MrOrz commented Feb 7, 2025 • edited Loading

coveralls commented Feb 7, 2025

MrOrz commented Feb 7, 2025 •

edited

Loading