[sc34wg3] Re: SAM issue: string-normalization

Lars Marius Garshol sc34wg3@isotopicmaps.org
28 Apr 2003 17:41:43 +0200

* Patrick Durusau
| Apologies for dropping this thread!

No worries. This list hasn't exactly been easy to keep up with.
| It is not a decomposition issue but compatibility-equivalent
| characters.

Ah, I see. Well, if we require one of the NFK forms we should
explicitly state that, but I don't think we can. It's pretty heavy,
and as the text you cite notes it also erases useful distinctions.
So I think that one is beyond what we can require.

| If there is a consensus by those who represent users in that domain
| on this position, I have no strong objections to keeping the prior
| decision. I noticed the possible problem when doing research on a
| TEI project and did not want the SAM to require a form of
| normalization that could have an adverse impact on some users.

I share that concern, actually. I would have strongly recommended
against requiring a particular normalization form had it not been for
the interaction with the sort order. Even so I am not sure if we made
the right decision with the sort order, but I'd like to think that one
through a little more before recommending anything.

Lars Marius Garshol, Ontopian         <URL: http://www.ontopia.net >
GSM: +47 98 21 55 50                  <URL: http://www.garshol.priv.no >