Community
Participate
Working Groups
Created attachment 230398 [details] Turkish test data case-insensitive search (Ctrl+F) in Orion Editor, ı (dot less small i) does not match with I (dot less capital i) İ (dot capital i) does not match with i (dot small i) Steps: 1. create a text in Orion Editor (Please copy and past from attached text) 2. search with Turkish dot less small i "ı" => does not hit "I" cf. In eclipse editor, "ı" hit both dot and dot less i "ı I i İ".
IIRC we rely on the browser's native RegExps for dealing with case sensitivity. And RegExp doesn't perform any intricate i18n case mapping, for example: > new RegExp("İ", "i" /*case insensitive*/).exec("i") // null We'd have to move to something like [1] to get the correct behavior for non-regex searches. [1] http://ecma-international.org/ecma-402/1.0/
Actually, I guess JavaScript does have what we need already, independent of ECMA-402. We'll have to fix the editor Find implementation to use locale-aware methods of String.
One approach is to convert everything to lowercase but this has performance implications for large files. There doesn't seem to be anything else that the browser RegEx engine can do to help.
We released a workaround for this problem, please give it a try.
Hi, I tested on I20130530-2250. The behavior is changed but it is not correct yet when case insensitive search with "i" and "I". search word (case insensitive) | ı | I | i | İ ==================================== ı (dot less small i) | o | o | o | I (dot less capital i) | | o | o | i (dot small i) | | o | o | İ (dot capital i) | | o | o | o expected result (ideal): search word (case insensitive) | ı | I | i | İ ==================================== ı (dot less small i) | o | o | | I (dot less capital i) | o | o | | i (dot small i) | | | o | o İ (dot capital i) | | | o | o on IES4.3 search word (case insensitive) | ı | I | i | İ ==================================== ı (dot less small i) | o | o | o | o I (dot less capital i) | o | o | o | o i (dot small i) | o | o | o | o İ (dot capital i) | o | o | o | o If ideal result is difficult to implement, I think it is ok to follow the IES result.
OK, we released a patch to match the IES behavior. As you know, the IES behavior is not strictly correct as dotless i should match I, and i should match capital dotted I. Unfortunately, we aren't able to implement the real behavior without first knowing the language of the browser (well we can, but it breaks search for English text). We looked at various ways of determining browser language and weren't able to find a method that would work for all browser on all platform. For future reference, it seems that one method that others use that could possibly work is to add some sort of backend support since all browser send the language as part of the request header. Perhaps there is some way for the file client to forward that language info along with the actual file contents.
verified with I20130606-2230