Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using an African Language corpus #2

Open
skaniaru opened this issue Nov 30, 2021 · 1 comment
Open

Using an African Language corpus #2

skaniaru opened this issue Nov 30, 2021 · 1 comment

Comments

@skaniaru
Copy link

Hey @tosingithub, I'm currently working on a school project about chatbots in Africa. How exactly did you go about building your chatbot using an African Language corpus?

@tosingithub
Copy link
Contributor

HI @skaniaru, We currently have about 7 African languages in this project and it was almost impossible getting conversation data for most so we translated the MultiWOZ dataset (1,500 turns - including training, 250 dev and 250 test sets) for each, except Yoruba, which we got data for, largely. The model used is DialoGPT for open domain conversational agents. So transfer learning was used to fine-tune the models. We will be reporting the results soon and publishing the paper. We haven't finished yet and we're also planning on extending the work to 15 languages.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants