-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect processing result for keywords having symbols #5
Comments
Thank you for reporting the issue. I'm in the first stage of troubleshooting the problem. It seems to be caused by function update_next_words_indices, which returns a wrong list of next words to be parsed. I will keep this issue updated when I have any new findings. |
From my side,
Take
However, for As I'm very busy with my study in this period, I won't be able to fix this bug anytime soon in ~1 month. |
This is considered a major development for the library, which I wouldn't be able to do this in the near future, due to a tight schedule as a last-year student. A suggestion on how to fix is to create a separated wordlist for |
Can't you just run the check on the text first, then if there is no detect, use regex to remove duplicates, and try again? |
Use word "s&m", "s & m", "2 girls 1 cups" ... to run
profanity.censor
with the default config got the incorrect result.for example:
why ?
The text was updated successfully, but these errors were encountered: