|
From: | Paul Eggert |
Subject: | [bug-diffutils] bug#31185: bug#31185: Why is there no full support for Unicode? |
Date: | Tue, 17 Apr 2018 00:37:18 -0700 |
User-agent: | Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 |
Keepun wrote:
Files with encoding greater than 8 bits without BOM at the beginning can be immediately identified as binary.
No, the BOM is not required or recommended in UTF-8, so it would be a mistake to identify GNU/Linux text files as binary merely because they lack a BOM. Typically these files do not have a BOM, and when they do one of the first things many users do is remove the BOM because it can cause trouble in practice.
Diffutils does not support UTF-16, where a BOM would make more sense, and there are no plans to add support for UTF-16 (or for UTF-32, for that matter).
[Prev in Thread] | Current Thread | [Next in Thread] |