lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Lynx-dev] mis-handling of combining characters


From: Thorsten Glaser
Subject: [Lynx-dev] mis-handling of combining characters
Date: Fri, 11 Aug 2023 02:32:36 +0000 (UTC)

Hi,

in the attached excerpt, stripped down to almost the minimum to
experience it, the combining characters in the “Zalgo” part seem
to be considered as adding to the line width, therefore breaking
too early:

   “Ȩ̶̛ņ̶̛Ơ̶̙U̶̡
   g̵̢H̷̳” Shikamaru entoned with fury. His were
   black pools that seemed to be dripping, melting.


   “ I am exacting my
   V̵̡e̷̢ņ̷̤G̴̨
   ̺̠ȩ̴̣Ą̶̱N̶̛
   ̀̓C̴̢E̴̡” Shikamaru, no, the
   shadows spoke.

(this time in an 80c-wide xterm at Ian’s special request)

I would expect (first line possibly justified with extra spaces):

   “Ȩ̶̛ņ̶̛Ơ̶̙U̶̡g̵̢H̷̳” Shikamaru entoned with fury. His were black pools that
   seemed to be dripping, melting.

   “ I am exacting my V̵̡e̷̢ņ̷̤G̴̨ȩ̴̣Ą̶̱N̶̛C̴̢E̴̡” Shikamaru, no, the shadows 
spoke.

I stripped all the charset identifiers when I dropped /html/head
so make sure it assumes UTF-8 when testing with the attached file.

bye,
//mirabilos
-- 
Solange man keine schmutzigen Tricks macht, und ich meine *wirklich*
schmutzige Tricks, wie bei einer doppelt verketteten Liste beide
Pointer XORen und in nur einem Word speichern, funktioniert Boehm ganz
hervorragend.           -- Andreas Bogk über boehm-gc in d.a.s.r

Attachment: x.htm
Description: Text document


reply via email to

[Prev in Thread] Current Thread [Next in Thread]