Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add dtw preset for large-v3-turbo #2481

Merged
merged 1 commit into from
Oct 15, 2024

Conversation

rotemdan
Copy link
Contributor

@rotemdan rotemdan commented Oct 15, 2024

See issue #2480 for more information.

The reference alignment heads for large-v3-turbo are:

44, 51, 63, 66, 71, 74

(extracted from reference Python implementation, see #2480 for more details)

Which, for n_text_head = 20 should imply:

{2, 4}, {2, 11}, {3, 3}, {3, 6}, {3, 11}, {3, 14}

(Note: not 100% sure - it's an extrapolation based on how other whisper.cpp alignment head indice lists relate to the reference)

I added a new preset:

static const whisper_ahead g_aheads_large_v3_turbo[]  = { {2, 4}, {2, 11}, {3, 3}, {3, 6}, {3, 11}, {3, 14} };
{ WHISPER_AHEADS_LARGE_V3_TURBO,  { 6, g_aheads_large_v3_turbo  } },
if (params.dtw == "large.v3.turbo")  cparams.dtw_aheads_preset = WHISPER_AHEADS_LARGE_V3_TURBO;

I did a few tests (in Echogarden) and it did seem to work correctly. The timing accuracy seemed consistent with other models.

Note: I named the preset large.v3.turbo to try to be consistent with how periods are used in all other presets. If you prefer something else like large.v3-turbo then change it.

@ggerganov ggerganov merged commit b604906 into ggerganov:master Oct 15, 2024
44 checks passed
bygreencn added a commit to bygreencn/whisper.cpp that referenced this pull request Oct 21, 2024
* ggerganov/master:
  readme : update links and make commands (ggerganov#2489)
  ruby : fix bindings (ggerganov#2484)
  readme : add Vulkan notice (ggerganov#2488)
  make : fix GGML_VULKAN=1 build (ggerganov#2485)
  whisper : add dtw preset for large-v3-turbo (ggerganov#2481)
  convert : handle max_target_positions (ggerganov#2477)
  readme : update the Quick Start section (ggerganov#2475)
  whisper : add OpenVINO init with state (ggerganov#2464)
lyapple2008 pushed a commit to lyapple2008/whisper.cpp.mars that referenced this pull request Nov 2, 2024
adutilleul pushed a commit to adutilleul/whisper.cpp that referenced this pull request Nov 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants