Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Generate alternative typos with a translation table
This way we can catch misspellings with alternative characters, typically typographic apostrophe or acute accent U+2019 (´) instead of typewriter apostrophe U+0027 ('). In this case, the alternative character is a valid character and will be used both in the misspelling and the fix(es). The above is different from detecting Unicode phishing, where some characters like `A` are intentionally, or not, replaced by lookalikes such as `A`, `Α`, `А`, `ᗅ`, `ᴀ`, `A`. In that case, the alternative character is invalid and should be replaced by its valid counterpart in the fix. We do not address that case here.
- Loading branch information