-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About usage #17
Comments
You can obtain the prediction diarization results by following these steps
Then you can obtain the decision results through a threshold, like in the image below, for single-speaker speech extraction. |
Hi, what if spk1 and spk2 have overlap? I just want a code that can send a voice in, output timestamp result. |
Sorry, I misunderstood that your input is non-overlapping multi-speaker speech. FS-EEND can naturally handle overlapping speech. An example of the output is shown in the figure below. The code for receiving a WAV file and outputting a Rich Transcription Time Marked (RTTM) file has been updated. You can infer by (# Modify val_data_dir in conf/xxx_infer.yaml according to your own WAV directory.) |
Am wondering if there any as simple as possible function to do this for example: I looked the train_dia_pred code, way to complicated and coupled with training all kinds of code. Would consider make a simple inference only code for users easy to use out of box? |
Sure, I understand your point. Thank you for your suggestion. We will simplify the inference code and update it in the repo. |
Thank u so much for the consideration! Hoping for a strong base diari model with overlap that can use at ease |
Hello, I'd like use this model performance a scenario of already segmented audio part by VAD, but these segments might have connected more than 1 speaker. Is. this model able to do that?
For detail, like:
they are different speaker, but VAD can't not separate them. I need their independent voices.
If it does, any simple code snippet can be referenced to do it?
The text was updated successfully, but these errors were encountered: