[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[bug #62618] utf8->string and string-foldcase do normalization by defaul
From: |
Taylor R. Campbell |
Subject: |
[bug #62618] utf8->string and string-foldcase do normalization by default |
Date: |
Sun, 12 Jun 2022 11:43:07 -0400 (EDT) |
URL:
<https://savannah.gnu.org/bugs/?62618>
Summary: utf8->string and string-foldcase do normalization by
default
Project: MIT/GNU Scheme
Submitter: riastradh
Submitted: Sun 12 Jun 2022 03:43:05 PM UTC
Category: runtime
Severity: 3 - Normal
Priority: 5 - Normal
Item Group: Incorrect behavior
Status: None
Privacy: Public
Assigned to: None
Originator Name:
Originator Email:
Open/Closed: Open
Discussion Lock: Any
Keywords:
_______________________________________________________
Follow-up Comments:
-------------------------------------------------------
Date: Sun 12 Jun 2022 03:43:05 PM UTC By: Taylor R. Campbell <riastradh>
The UTF-8 decoding algorithm and Unicode case-folding algorithm do not involve
normalization, but the utf8->string and string-foldcase procedures always
normalize into NFC.
This is because they use the default mode of string-builder, which is nfc. If
they used (builder 'immutable) instead of (builder) this would fix the
problem, but I tend to think that the string builder should default to no
normalization unless you ask for it.
In any case there should also be automatic tests of utf8->string and
string-foldcase and others to catch this.
_______________________________________________________
Reply to this item at:
<https://savannah.gnu.org/bugs/?62618>
_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [bug #62618] utf8->string and string-foldcase do normalization by default,
Taylor R. Campbell <=