[wdiff-bugs] Bug#553490: wdiff: Does not handle UTF-8 properly (fwd)

From: Santiago Vila
Subject: [wdiff-bugs] Bug#553490: wdiff: Does not handle UTF-8 properly (fwd)
Date: Thu, 20 Oct 2011 13:02:34 +0200 (CEST)
User-agent: Alpine 2.00 (DEB 1167 2008-08-23)


I received this from the Debian bug system.
I've checked and the current version (1.0.1) still shows the bug.
[ Please keep the Cc: lines when replying, thanks ].

[ Apologies to the submitter for taking so long to process this ]

---------- Forwarded message ----------
From: Josh Triplett <address@hidden>
To: Debian Bug Tracking System <address@hidden>
Date: Sat, 31 Oct 2009 11:39:08 -0700
Subject: wdiff: Does not handle UTF-8 properly

Package: wdiff
Version: 0.5-19
Severity: normal

"wdiff -a" uses backspace and overstrike to provide emphasis; thus, it
will emphasize 'x' by printing 'x^Hx'.  When it encounters a UTF-8
character, it does this for each byte, rather than for each character;
thus, emphasis of <E2><80><99> (U+2019 RIGHT SINGLE QUOTATION MARK)
looks like '<E2>^H<E2><80>^H<80><99>^H<99>', when it should look
like '<E2><80><99>^H<E2><80><99>'.

- Josh Triplett


