bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: tr not respecting UTF-8 locale ?


From: Paul Eggert
Subject: Re: tr not respecting UTF-8 locale ?
Date: Mon, 11 Oct 2004 16:52:59 -0700
User-agent: Gnus/5.1006 (Gnus v5.10.6) Emacs/21.3 (gnu/linux)

Michał Kosmulski <address@hidden> writes:

> I am using a UTF-8 locale and all coreutils except for tr seem to
> agree on that. However, tr behaves differently and seems to always
> assume that 1 byte == 1 character even in a UTF-8 locale.

Most coreutils utilities do not support UTF-8 in general, and
tr is no exception.

POSIX does not require support for multibyte locales, so this is not a
standards-conformance issue, though of course it is a problem.

Personally, I don't even understand what tr is _supposed_ to do with
UTF-8 in general.  For example, I don't really understand the
difference between "tr -c" and "tr -C" as specified by POSIX.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]