Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dogtooth dilemma #18

Open
m0rphed opened this issue Dec 6, 2024 · 0 comments
Open

dogtooth dilemma #18

m0rphed opened this issue Dec 6, 2024 · 0 comments

Comments

@m0rphed
Copy link

m0rphed commented Dec 6, 2024

Wrong phonetic IPA transcription of word dogtooth (probably because it's missing in lexicon.db)
image

When running:

from gruut import sentences

text_1 = """
She was wearing a dogtooth check,
two-piece suit which she had made herself,
red shoes and a small red handbag.    
"""

for sent in sentences(
    text_1,
    lang="en-us",
    minor_breaks=False,
    major_breaks=False,
    punctuations=False,
    break_phonemes=False,
    espeak=True
):
    for word in sent:
        if word.phonemes:
            print(word.text, "".join(word.phonemes), sep="; ")
            print("\t=>", *word.phonemes) 

Dogtooth transcribes to "dˈɑːtˈuːθ", however espeak-ng returns:

espeak-ng -v en --ipa -q "She was wearing a dogtooth check, two-piece suit which she had made herself, red shoes and a small red handbag."
ʃiː wɒz wˈeəɹɪŋ ɐ dˈɒɡtuːθ tʃˈɛk
tˈuːpˈiːs sˈuːt wˌɪtʃ ʃiː had mˌeɪd hɜːsˈɛlf
ɹˈɛd ʃˈuːz and ɐ smˈɔːl ɹˈɛd hˈandbaɡ

-- dˈɒɡtuːθ which is suppose to be correct pronunciation for en-us

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant