Skip to content

TLG for k2 #1025

Answered by csukuangfj
homink asked this question in Q&A
Apr 28, 2023 · 3 comments · 13 replies
Discussion options

You must be logged in to vote

This is really helpful. I would like to share what I am doing. I like to connect wav2vec2 output with TLG in K2.

This wav2vec2 was fine-tuned with the vocab:

Is the model fine-tuned with CTC loss?

If not, then you cannot use TLG or T to decode its output.

If yes, then congratulations you can use either T or TLG to decode its output.

The following is an example to decode a Wav2Vec 2.0 model fine-tuned with CTC loss from torchaudio using a T graph:
k2-fsa/k2#1096 (comment)

We also have a C++ runtime in sherpa to support it. Please see
https://k2-fsa.github.io/sherpa/cpp/pretrained_models/offline_ctc/torchaudio.html


By the way, if you only want to T for decoding, you don't need #0, #1, …

Replies: 3 comments 13 replies

Comment options

You must be logged in to vote
5 replies
@homink
Comment options

@csukuangfj
Comment options

@homink
Comment options

@csukuangfj
Comment options

@homink
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by homink
Comment options

You must be logged in to vote
8 replies
@csukuangfj
Comment options

@homink
Comment options

@csukuangfj
Comment options

@csukuangfj
Comment options

@homink
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants