[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug-gettext] [bug #52932] XML doesn’t default to UTF-8

From: Roumen Petrov
Subject: Re: [bug-gettext] [bug #52932] XML doesn’t default to UTF-8
Date: Sat, 20 Jan 2018 17:37:39 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:33.0) Gecko/20100101 Firefox/33.0 SeaMonkey/2.30

Bruno Haible wrote:
Update of bug #52932 (project gettext):

                   Status:                    None => Need Info


Follow-up Comment #1:

Why? Why make an assumption about the encoding (that assumption can be wrong)
- thus possibly produce an file in another encoding than the one the caller
expects? When we have a way to get away without this assumption and produce a
valid and unambiguous XML always.

There is no exactly default encoding for xml as specification requires xml processor to support UTF-8 and UTF-16 encoding - for more details see chapter https://www.w3.org/TR/xml/#charencoding .

Next encoding is required if is not UTF-8 or UTF-16. Actually is more complicated. Let me quote from specification (same chapter) : "In the absence of external character encoding information (such as MIME headers), parsed entities which are stored in an encoding other than UTF-8 or UTF-16 /MUST/ begin with a text declaration...."

So if there is no explicit specification of according to standard encoding is either UTF-8 or UTF-16.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]