Compile liblouis with 32 bit widechars #9544

LeonarddeR · 2019-05-07T14:39:13Z

Link to issue number:

Summary of the issue:

Liblouis currently uses a 2 byte encoding to process braille. This is pretty annoying when displaying emoji, as they are 32 bit unicode characters. For example, 😉 is usually printed as '\xd83d''\xde09'.

More importantly though, using a 2 byte encoding with Python 3 is subject to break things in a major way. The braille module uses brailleToRawPos and rawToBraillePos to mape braille characters to real characters. In python 2, unicode strings are internally saved with a two byte encoding. Therefore, 32 bit unicode characters take two indexes or offsets in a string. In python 3, one index/offset corresponds with a code point. Liblouis 2 byte wide characters played pretty nicely with Python 2 unicode strings, but with 16 bit wide characters on python 3, the rawToBraillePos and brailleToRawPos mappings do no longer match, as liblouis reads 😉 as two characters whether Python 3 reads them as one.

Description of how this pull request fixes the issue:

This compiles liblouis with 32 bit wide characters instead of 16. This means only one replacement pattern is printed for 32 bit characters instead of two, and it also should ensure that brailleToRawPos and rawTobraillePos mappings are correct, as both Python 3 and Liblouis UCS4 assume that all characters in the wild only take one offset in a string.

Testing performed:

This pr is pretty theoretically. Testing can be performed as soon as #9543 is merged. Therefore, I will mark this a draft until that's the case.

Known issues with pull request:

None known as of yet

Change log entry:

Bug fixes
+Emoji and other 32 bit unicode characters now take less space on a braille display when they are shown as hexadecimal values. (Compiling Liblouis with 32-bit Unicode support #6695)

Adriani90 · 2019-05-07T14:55:30Z

cc: @DrSooom you have spent lot of work improving displaying of unicode characters. I think your thoughts are here also very apreciated.

DrSooom · 2019-05-08T02:43:10Z

@Adriani90: Thanks for the notification. It seems that @LeonarddeR split PR #9044 into PR #9544 and #9545 and updated both.

@LeonarddeR: Please also see #8702 and liblouis/liblouis#730.

+Emoji and other 32 bit unicode characters now take less space on a braille display when they are undefined in a braille translation table. (#6695)

This isn't fully correct due to the definition (e.g. "undefined 0") in some braille tables. Undefined Unicode characters can also be displayed just as ⠀ (dot 0). Please read the HUC Braille Tables documentation for further details.

The one and only thing I have to know here is if I have to change all yhhhhh definitions to zhhhhhhhh definitions in the HUC Braille Tables. But these replacements are done quite quickly – in compare of the whole creating process of the HUC Braille Tables. Well, after NVDA fully supports UTF-32 characters I have to update the HUC Braille Tables documentation as well, because it references to NVDA 2019.1 yet.

Personally I really want to see the UTF-32 support in NVDA, because by using the HUC Braille Tables the amount of necessary braille characters for an undefined Unicode character between U+10000 and U+10FFFF is reduced from 16 to 3 8-dot braille characters. That would be great.

PS: In less than four hours I'm sitting in the train to the SightCity 2019 where I'm going to inform some people about the existence of the HUC Braille Tables. You can find A7 handouts ((cc) by-sa in EN and DE) regarding the HUC Braille Tables here on my website.

LeonarddeR · 2019-05-08T07:41:47Z

@LeonarddeR: Please also see #8702 and liblouis/liblouis#730.

These are out of scope for this pr. This pr aims at fixing braille issues introduced when switching to Python 3, nothing more than that.

+Emoji and other 32 bit unicode characters now take less space on a braille display when they are undefined in a braille translation table. (#6695)

Thanks, I will fix this entry.

nvdaHelper/liblouis/sconscript

source/NVDAObjects/IAccessible/__init__.py

michaelDCurran · 2019-05-31T07:53:14Z

Fair enough. I understand now. Leave this change in.

LeonarddeR · 2019-06-13T11:18:27Z

I changed the base branch to threshold_py3_staging. I think threshold_py3_staging is now in a state where it might even need this for braille unit tests to pass at some point.

Leonard de Ruijter added 2 commits May 7, 2019 16:19

Build liblouis with 32-bit unicode support (UCS-4)

5818e9b

Remove the /WX flag when compiling liblouis

85c89b0

Fix character offsets for IA2

e6ab975

LeonarddeR requested a review from michaelDCurran May 30, 2019 19:22

michaelDCurran requested changes May 31, 2019

View reviewed changes

nvdaHelper/liblouis/sconscript Outdated Show resolved Hide resolved

source/NVDAObjects/IAccessible/__init__.py Show resolved Hide resolved

Leonard de Ruijter added 2 commits May 31, 2019 08:49

Merge branch 'threshold' into liblouis-ucs4

0740a3f

Review action

8085fed

Leonard de Ruijter added 2 commits June 7, 2019 08:16

Merge remote-tracking branch 'origin/threshold' into liblouis-ucs4

04c7af1

No longer remove /WX flag for liblouis builds

46e5014

LeonarddeR marked this pull request as ready for review June 13, 2019 11:15

LeonarddeR changed the base branch from threshold to threshold_py3_staging June 13, 2019 11:16

LeonarddeR requested a review from michaelDCurran June 13, 2019 11:16

michaelDCurran approved these changes Jun 17, 2019

View reviewed changes

michaelDCurran merged commit c312713 into nvaccess:threshold_py3_staging Jun 17, 2019

nvaccessAuto added this to the 2019.3 milestone Jun 17, 2019

Adriani90 mentioned this pull request Jun 18, 2019

Compiling Liblouis with 32-bit Unicode support #6695

Closed

LeonarddeR deleted the liblouis-ucs4 branch June 20, 2019 07:19

school510587 mentioned this pull request Jun 22, 2019

[Feature Request] Allow optionally enabled rules liblouis/liblouis#664

Closed

josephsl mentioned this pull request Jul 23, 2019

What's new and readme: we are moving to Python 3.7 #9942

Merged

DrSooom mentioned this pull request Jul 26, 2019

Braille display auto-detection feature causes errors on loading large braille tables on NVDA startup #9982

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compile liblouis with 32 bit widechars #9544

Compile liblouis with 32 bit widechars #9544

LeonarddeR commented May 7, 2019 •

edited

Loading

Adriani90 commented May 7, 2019

DrSooom commented May 8, 2019

LeonarddeR commented May 8, 2019

michaelDCurran commented May 31, 2019 via email

LeonarddeR commented Jun 13, 2019

Compile liblouis with 32 bit widechars #9544

Compile liblouis with 32 bit widechars #9544

Conversation

LeonarddeR commented May 7, 2019 • edited Loading

Link to issue number:

Summary of the issue:

Description of how this pull request fixes the issue:

Testing performed:

Known issues with pull request:

Change log entry:

Adriani90 commented May 7, 2019

DrSooom commented May 8, 2019

LeonarddeR commented May 8, 2019

michaelDCurran commented May 31, 2019 via email

LeonarddeR commented Jun 13, 2019

LeonarddeR commented May 7, 2019 •

edited

Loading