Skip to content
This repository has been archived by the owner on Dec 15, 2022. It is now read-only.

Problem with Unicode support #971

Open
digital-carver opened this issue May 26, 2018 · 2 comments · May be fixed by #972
Open

Problem with Unicode support #971

digital-carver opened this issue May 26, 2018 · 2 comments · May be fixed by #972
Labels

Comments

@digital-carver
Copy link

Description

Auto-completion doesn't work properly when writing Tamil text. This is probably because the unicode-helpers.js file has only the codepoints from the Letter Unicode category, where it should actually have all codepoints that have the Alphabetic property. So for eg. it has 0B95 for க, but not 0BC0 which is a vowel mark that combines with க to make கீ.

To give an idea of how weird this is, Atom gives completion when I type the equivalent of "SaRaGaMa", but not when I type the equivalent Tamil text of "SaReGaMa" - having any vowel other than a (அ) in the prefix disables the autocompletion.

A lot of codepoints for many Indic scripts (and some other Asian scripts) are placed in the M (Mark) categories in Unicode, and then given the Other_Alphabetic property (find for Other_Alphabetic in https://www.unicode.org/Public/UCD/latest/ucd/PropList.txt). This means the current completion probably doesn't work properly for any of those languages. The solution, afaict, is just to add all the Other_Alphabetic codepoints from that page to unicode-helpers.js.

Steps to Reproduce

  1. Type (or paste) எழுத்துக்கள் on the first line of a file
  2. Type (or paste) எழுத்து on the second line

Expected behavior: Completion to எழுத்துக்கள் should appear

Actual behavior: No completion appears - and pressing Ctrl-Space does nothing either

Reproduces how often: 100%

Versions

Version 1.27.1 on Windows 7 64-bit

@digital-carver
Copy link
Author

digital-carver commented May 26, 2018

(This is all after enabling 'Extended Unicode Support' - without checking that option, no completion happens at all; after checking it, completion happens only for 'a' vowel like mentioned in the second para above.

Also, changing between the Sequence and Symbol modes doesn't seem to make any difference. )

@digital-carver digital-carver linked a pull request May 27, 2018 that will close this issue
@rsese
Copy link

rsese commented May 31, 2018

Thanks for the report! I can reproduce with your steps on macOS 10.12.6 and Atom 1.29.0-dev-e31c972d3.

@rsese rsese added the triaged label May 31, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants