Correctly handle UTF-16 offsets #7

fhs · 2019-05-31T19:52:28Z

LSP uses UTF-16 offsets:

A position inside a document (see Position definition below) is expressed as a zero-based line and character offset. The offsets are based on a UTF-16 string representation. So a string of the form a𐐀b the character offset of the character a is 0, the character offset of 𐐀 is 1 and the character offset of b is 3 since 𐐀 is represented using two code units in UTF-16.

Acme uses rune offsets. Currently, we treat the UTF-16 offsets as rune offsets (and vice versa) for an easier implementation, which is obviously wrong.

fhs added the bug Something isn't working label May 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correctly handle UTF-16 offsets #7

Correctly handle UTF-16 offsets #7

fhs commented May 31, 2019

Correctly handle UTF-16 offsets #7

Correctly handle UTF-16 offsets #7

Comments

fhs commented May 31, 2019