HCL Compass character exclusion lists

These tables list, for each applicable data code page, the set of characters included in the corresponding Microsoft™ Windows™ code page but excluded from the HCL Compass data code page.

Table 1. Data code page 949 (Korean) character exclusion list
Unicode value Character description
0x007f delete
0x0080 unassigned
0x00ad soft hyphen
0x00b7 middle dot
0x2015 horizontal bar
0x223c tilde operator
0x2299 circled dot operator
0xff5e fullwidth tilde
Table 2. Data code page 950 (Traditional Chinese) character exclusion list
Unicode value Character description
0x0080 unassigned
0x00AF macron
0x2013 en dash
0x2223 divides
0x2551 box drawings double vertical
0x2552 box drawings down single and right double
0x2553 box drawings down double and right single
0x2554 box drawings double down and right
0x2555 box drawings down single and left double
0x2556 box drawings down double and left single
0x2557 box drawings double down and left
0x2558 box drawings up single and right double
0x2559 box drawings up double and right single
0x255A box drawings double up and right
0x255B box drawings up single and left double
0x255C box drawings up double and left single
0x255D box drawings double up and left
0x255F box drawings vertical double and right single
0x2560 box drawings double vertical and right
0x2562 box drawings vertical double and left single
0x2563 box drawings double vertical and left
0x2564 box drawings down single and horizontal double
0x2565 box drawings down double and horizontal single
0x2566 box drawings double down and horizontal
0x2567 box drawings up single and horizontal double
0x2568 box drawings up double and horizontal single
0x2569 box drawings double up and horizontal
0x256B box drawings vertical double and horizontal single
0x256C box drawings double vertical and horizontal
0x2593 dark shade
0x58BB CJK unified ideograph
0x5AFA CJK unified ideograph
0x5F5D CJK unified ideograph
0x6052 CJK unified ideograph
0x7881 CJK unified ideograph
0x7CA7 CJK unified ideograph
0x88CF CJK unified ideograph
0x92B9 CJK unified ideograph
0xFF5E fullwidth tilde
Table 3. Data code page 1253 (Greek) character exclusion list
Unicode value Character description
0x00aa unassigned
Table 4. Data code page 1255 (Hebrew) character exclusion list
Unicode value Character description
0x00a1 inverted exclamation mark
0x00b8 cedilla
0x00bf inverted question mark
0x00d7 multiplication sign
0x00f7 division sign
0x05f3 hebrew punctuation geresh
0x05f4 hebrew punctuation gershayim
Table 5. Data code page 1257 (Baltic) character exclusion list
Unicode value Character description
0x00a8 diaeresis
0x00af macron
0x00b4 acute accent
0x00b8 cedilla
0x02c7 caron
0x02d9 dot above
0x02db ogonek