https://bugs.freedesktop.org/show_bug.cgi?id=64173
--- Comment #3 from Patrick Ohly <patrick.ohly(a)gmx.de> ---
(In reply to comment #1)
Here are four names, one per line:
Adams
Jeffries
江
Meadows
江 has Jiang has Pinyin representation, so a collation based on Pinyin should
sort as shown above (江 = Jiang after Jeffries and before Meadows). At least
that's my understanding.
A Chinese colleague confirmed that this is indeed what he expects.
From the icu-support mailing list:
-----------------------
From: Mark Davis ☕ <mark(a)macchiato.com>
Reply-to: ICU support mailing list <icu-support(a)lists.sourceforge.net>
To: ICU support mailing list <icu-support(a)lists.sourceforge.net>
Subject: Re: [icu-support] pinyin sorting in zh_CN.UTF-8
Date: Mon, 13 May 2013 13:02:11 +0200
People have different expectations for pinyin. Some possibilities are:
Sort Chinese characters in pinyin order, but separate from Latin
Sort them interleaved with Latin, by the first character.
Sort them fully interleaved with Latin.
For #2, the easiest way to do it is with the Alphabetic index. For
#3, the best is to use a Han-Latin transliterator to get a key, then sort by
that key.
------------------------
We now know that ICU implements option 1, so implementing the expected outcome
will be more work. We also need to determine whether #2 or #3 are expected.
--
You are receiving this mail because:
You are on the CC list for the bug.
You are the assignee for the bug.