Some Eclipse Foundation services are deprecated, or will be soon. Please ensure you've read this important communication.

Bug 343716

Summary: DBCS4.1: Not search high surrogate characters with regular expressions \uXXXX
Product: [Eclipse Project] Platform Reporter: Masaihko Maedera <maedera>
Component: SearchAssignee: Platform-Search-Inbox <platform-search-inbox>
Status: RESOLVED NOT_ECLIPSE QA Contact:
Severity: normal    
Priority: P3 CC: daniel_megert, pwebster
Version: 4.1   
Target Milestone: ---   
Hardware: PC   
OS: Linux   
Whiteboard:
Attachments:
Description Flags
uD842_uDF9F.txt, encoded with UTF-8 none

Description Masaihko Maedera CLA 2011-04-25 02:56:44 EDT
Build Identifier: I20110419-2200

OS: Red Hat Linux 6
Eclipse SDK 4.1
JRE: IBM Java 1.6.0

High surrogate characters can not be found with regular expression \uXXXX,
from the search/find/replace dialog.
On the other hand, low surrogate characters can be found.

Reproducible: Always

Steps to Reproduce:
1.Import uD842_uDF9F.txt, attached to this bug report, encoded with UTF-8.
2.[Search] -> [Search] -> [File Search], then check Regular expressions.
3.Type "\uD842" as "Containing text:".
4.Click [Search]. but it does not work.
or
5.Open uD842_uDF9F.txt
6.[Edit] -> [Find/Replace], then check Regular expressions.
7.Type "\uD842" as "Find:".
8.Click [Find], but it does not work.
Comment 1 Masaihko Maedera CLA 2011-04-25 02:59:33 EDT
Created attachment 193979 [details]
uD842_uDF9F.txt, encoded with UTF-8
Comment 2 Dani Megert CLA 2011-04-25 11:49:20 EDT
We use the search support from the JRE. Please file the bug against the JRE you are using.