[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Lynx-dev] mis-handling of combining characters
From: |
Thorsten Glaser |
Subject: |
[Lynx-dev] mis-handling of combining characters |
Date: |
Fri, 11 Aug 2023 02:32:36 +0000 (UTC) |
Hi,
in the attached excerpt, stripped down to almost the minimum to
experience it, the combining characters in the “Zalgo” part seem
to be considered as adding to the line width, therefore breaking
too early:
“Ȩ̶̛ņ̶̛Ơ̶̙U̶̡
g̵̢H̷̳” Shikamaru entoned with fury. His were
black pools that seemed to be dripping, melting.
“ I am exacting my
V̵̡e̷̢ņ̷̤G̴̨
̺̠ȩ̴̣Ą̶̱N̶̛
̀̓C̴̢E̴̡” Shikamaru, no, the
shadows spoke.
(this time in an 80c-wide xterm at Ian’s special request)
I would expect (first line possibly justified with extra spaces):
“Ȩ̶̛ņ̶̛Ơ̶̙U̶̡g̵̢H̷̳” Shikamaru entoned with fury. His were black pools that
seemed to be dripping, melting.
“ I am exacting my V̵̡e̷̢ņ̷̤G̴̨ȩ̴̣Ą̶̱N̶̛C̴̢E̴̡” Shikamaru, no, the shadows
spoke.
I stripped all the charset identifiers when I dropped /html/head
so make sure it assumes UTF-8 when testing with the attached file.
bye,
//mirabilos
--
Solange man keine schmutzigen Tricks macht, und ich meine *wirklich*
schmutzige Tricks, wie bei einer doppelt verketteten Liste beide
Pointer XORen und in nur einem Word speichern, funktioniert Boehm ganz
hervorragend. -- Andreas Bogk über boehm-gc in d.a.s.r
x.htm
Description: Text document
- [Lynx-dev] mis-handling of combining characters,
Thorsten Glaser <=