guile-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: SRFI-14 and locale settings


From: Neil Jerram
Subject: Re: SRFI-14 and locale settings
Date: Mon, 04 Sep 2006 07:41:19 +0100
User-agent: Gnus/5.1007 (Gnus v5.10.7) Emacs/21.4 (gnu/linux)

address@hidden (Ludovic Courtès) writes:

> Hi,
>
> SRFI-14 doesn't take into account the current locale, mostly because
> `scm_init_srfi_14 ()' gets invoked before the user has had any chance to
> run code like `(setlocale ...)'.  Thus, for instance, `char-set:letter'
> is always initialized with the English set of letters.
>
> Since SRFI-13 is initialized in core Guile, SRFI-14 needs to be
> initialized there too.  But do you guys have an idea of how we could
> work around this?

Here's what SRFI 14 says about char-set:letter:

  char-set:letter

  In Unicode, a letter is any character with one of the letter
  categories (Lu, Ll, Lt, Lm, Lo) in the Unicode character database.

  There are 52 ASCII letters
  abcdefghijklmnopqrstuvwxyz
  ABCDEFGHIJKLMNOPQRSTUVWXYZ

  There are 117 Latin-1 letters. These are the 115 characters that are
  members of the Latin-1 char-set:lower-case and char-set:upper-case
  sets, plus

  00AA  FEMININE ORDINAL INDICATOR
  00BA  MASCULINE ORDINAL INDICATOR

  (These two letters are considered lower-case by Unicode, but not by
  Java or SRFI 14.)

My reading of this is that it is trying to be locale-independent,
based on Unicode category definitions.  Isn't that correct?

(It may of course be that Guile's current implementation doesn't
return the complete set that is implied by this definition, because
it's bugged or because we don't have Unicode support yet, but that's a
different kind of problem.)

Regards,
     Neil





reply via email to

[Prev in Thread] Current Thread [Next in Thread]