emacs-pretest-bug
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: case-table functions clobbering extra slots


From: Kenichi Handa
Subject: Re: case-table functions clobbering extra slots
Date: Mon, 31 Jan 2005 09:21:28 +0900 (JST)
User-agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/21.3.50 (sparc-sun-solaris2.6) MULE/5.0 (SAKAKI)

In article <address@hidden>, Dave Love <address@hidden> writes:

> Kenichi Handa <address@hidden> writes:
>>  Downcase of dotted-I is dotted-i (`i'), but, in
>>  lang. env. other than Turkish, upcase of dotted-i must be
>>  dotless-I (`I').  So we need
>>    (set-downcase-syntax dotted-I dotted-i table)

> Do you mean that what I did with the existing functions isn't right?

Yes, if you are talking about what you did for Turkish
lang. env. in emacs-unicode.  setup-fucntion is ok, but
exit-function should not make dottless-i and I-with-dot
case-invariant.  And, we should add calls of
set-upcase-syntax and set-downcase-syntax for them in
characters.el.

UnicodeData.txt defines them as this (the last three fields
are for upcase, downcase, titlecase, and 0049 == `I', 0069
== `i'):

0130;LATIN CAPITAL LETTER I WITH DOT ABOVE;Lu;0;L;0049 0307;;;;N;LATIN CAPITAL 
LETTER I DOT;;;0069;
0131;LATIN SMALL LETTER DOTLESS I;Ll;0;L;;;;;N;;;0049;;0049

>>  I'd like to ask it to someone whose native lang. uses
>>  case-sentive characters.  Or, at least, I need a help of an
>>  expert on this matter.  As my knowledge about case-sentive
>>  characters is only for English (only one-to-one, and no
>>  diacriticals), I may overlook many cases to be considered
>>  (e.g. I didn't know about the sigma case).

> The special cases are listed in one of the Unicode URLs I gave.  I'm
> not sure it necessary to consult an expert except to find out how
> important difficult cases are.

I've read that file, but I don't know what is an appropriate
case-conversion behaviour of Emacs, for instance, for such
characters that don't have corresponding precomposed
uppercase characters.  This problem is also related to how
to handle various kinds of normalization forms in Emacs.

---
Ken'ichi HANDA
address@hidden




reply via email to

[Prev in Thread] Current Thread [Next in Thread]