Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider separating words based on /\W/ instead of whitespace. #65

Open
lewang opened this issue Sep 21, 2014 · 2 comments
Open

Consider separating words based on /\W/ instead of whitespace. #65

lewang opened this issue Sep 21, 2014 · 2 comments

Comments

@lewang
Copy link
Owner

lewang commented Sep 21, 2014

Consider:

(flx-get-heatmap-str "*group*") ; [84 -2 -3 -4 -5 -6 82]

The first "*" is considered the beginning of the word instead of "g". This should be changed to group non-word non-space characters together as words.

See: #63

@jeancroy
Copy link

The problem with \w is that in most implementation it is not unicode friendly.
You'll split on accented letters like é, û, à, and probably any non latin character.

whitelist of a few separator will be more unicode friendly.

@PythonNut
Copy link
Collaborator

The problem with \w is that in most implementation it is not unicode friendly.
You'll split on accented letters like é, û, à, and probably any non latin character.

Emacs has excellent Unicode support, and handles non-latin characters correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants