Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vowels and consonants in viIPA. #2

Open
drlor2k opened this issue Jul 22, 2024 · 3 comments
Open

Vowels and consonants in viIPA. #2

drlor2k opened this issue Jul 22, 2024 · 3 comments

Comments

@drlor2k
Copy link

drlor2k commented Jul 22, 2024

hello @v-nhandt21 , I have a question, please help me!

Based on the following quote:

A universal monosyllabic phoneme system has "C(m)-V-C(n)" (m,n >= 0) phoneme patterns.

Based on the MFA author's dictionary I was able to find a list of vowels and consonants.

With viIPA, do you have a specific list for vowels and consonants?

@v-nhandt21
Copy link
Owner

hello @v-nhandt21 , I have a question, please help me!

Based on the following quote:

A universal monosyllabic phoneme system has "C(m)-V-C(n)" (m,n >= 0) phoneme patterns.

Based on the MFA author's dictionary I was able to find a list of vowels and consonants.

With viIPA, do you have a specific list for vowels and consonants?

Yep, for Viphoneme, you can get list of phoneme by: https://pypi.org/project/viphoneme/1.0.5/

image

@drlor2k
Copy link
Author

drlor2k commented Jul 23, 2024

hi @v-nhandt21, I have some questions:

  1. Each phonetic word in your IPA system has only one vowel, right?
  2. If I merge the tone with the vowel to create another viIPA.txt (ie bận -> b ɤ̆ n 6 -> b ɤ̆6 n), will the whole process of viMFA be affected?
  3. Because the author's MFA for Vietnamese is quite limited in terms of vocabulary, I want to train on my data set. I see version 3.0 of the original author training on a dataset of about 40 hours. However, my dataset is only about 6 hours. Does large and small data affect the final TextGrid quality?

Thank you :3

@v-nhandt21
Copy link
Owner

v-nhandt21 commented Jul 24, 2024

hi @v-nhandt21, I have some questions:

  1. Each phonetic word in your IPA system has only one vowel, right?
  2. If I merge the tone with the vowel to create another viIPA.txt (ie bận -> b ɤ̆ n 6 -> b ɤ̆6 n), will the whole process of viMFA be affected?
  3. Because the author's MFA for Vietnamese is quite limited in terms of vocabulary, I want to train on my data set. I see version 3.0 of the original author training on a dataset of about 40 hours. However, my dataset is only about 6 hours. Does large and small data affect the final TextGrid quality?

Thank you :3

Hi @drlor2k ,

  1. Yep :))
  2. I think "b ɤ̆6 n" seems to be more effective than "b ɤ̆ n 6"
  3. I am not sure about the size and quality of your dataset, hihi, so try to use as much as possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants