emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: find-composition still depends on the composition property


From: Eli Zaretskii
Subject: Re: find-composition still depends on the composition property
Date: Wed, 22 Oct 2008 21:35:40 +0200

> From: Kenichi Handa <address@hidden>
> Date: Wed, 22 Oct 2008 14:29:47 +0900
> Cc: address@hidden, address@hidden
> 
> In article <address@hidden>, Kenichi Handa <address@hidden> writes:
> 
> > Ah, it's not a bug of composition, but a bug of scan_words
> > (syntax.c).  Currently U+301 is labeled as `latin' script,
> > and the surrounding characters there are `cyrillic' script.
> > Thus, that funciton thinks that there's a word boundary.
> > I'll find a way to solve this problem.
> 
> I've just installed a fix.

Thanks, but Emacs still does not get this quite right.  For example,
in the following line:

  אבגדה12345

Which mixes Hebrew letters with digits, M-f stops at the first digit,
whereas in this line:

  abcde12345

it does not.  The latter behavior is correct, the former is not.  (I'm
ashamed to admit that even MS Word gets it right.)

I understand that the way for fixing this would be to install more
entries in word-combining-categories, but more infrastructure seems to
be missing, since right now no characters have the "Hebrew" category,
for example (at least judging by the output of describe-categories).

By the way, I'd suggest to move the legend generated by
describe-categories to the beginning of the buffer, because the buffer
is huge and it does not say anywhere at the beginning that there's a
legend at the end.  Without the legend, the buffer looks like a large
pile of gibberish.

And another wish: can we have word-combining-categories and
word-separating-categories display their elements with human-readable
letters, not as their ASCII codes?  (Quick: what letter is code 94?)





reply via email to

[Prev in Thread] Current Thread [Next in Thread]