You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed while transcribing some of my own audio that near-silence doesn't get removed during VAD. In fact, running noisereduce actually made the problem dramatically worse, making 10 seconds of falsely detected speech into a minute and a half of falsely detected speech.
Apologies if I'm referring to the wrong version of Silero but it seems like this a known issue / feature(tm). snakers4/silero-vad#396
Preforming a volume filter along with VAD might solve a fair number of hallucinations and might even remove the need to set condition_on_previous_text to False prevent the hallucinations from ruining the rest (section) of the transcription.
I'm down to try it out and open a PR if you all are welcome to it. Before I do though, I'm curious if this came up when adding the hallucination detection logic.
It'd be really nice to have in this library so that I don't have to preform a second layer of timestamp adjustments.
The text was updated successfully, but these errors were encountered:
I noticed while transcribing some of my own audio that near-silence doesn't get removed during VAD. In fact, running
noisereduce
actually made the problem dramatically worse, making 10 seconds of falsely detected speech into a minute and a half of falsely detected speech.Apologies if I'm referring to the wrong version of Silero but it seems like this a known issue / feature(tm). snakers4/silero-vad#396
Preforming a volume filter along with VAD might solve a fair number of hallucinations and might even remove the need to set
condition_on_previous_text
toFalse
prevent the hallucinations from ruining the rest (section) of the transcription.I'm down to try it out and open a PR if you all are welcome to it. Before I do though, I'm curious if this came up when adding the hallucination detection logic.
It'd be really nice to have in this library so that I don't have to preform a second layer of timestamp adjustments.
The text was updated successfully, but these errors were encountered: