Replies: 1 comment 2 replies
-
Hi @KryxoLV, thanks for your request. If you want to detect the language of separate botanical terms, then it is not the right approach to add all the terms to be classified to some kind of dictionary. My library does not work like that. The library determines the language mainly by calculating statistics for the distribution of letter combinations (ngrams) in a text. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, maybe there is someone who can explain me how to update the already existing language models? I am doing bachelors thesis on botanical terms in Latvian, and I have laready added a couple thousands of these botanicla terms to the .txt file , but what should i do next to teach the AI how to tell them apart ? I have added 1500 lines of text to "Lv.txt" as well as "la.txt". How can I rebuild the language models?
Beta Was this translation helpful? Give feedback.
All reactions