wdiff-bugs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Japanese '者' (U+8005) is replaced with \350\200


From: OKUMURA, Akira
Subject: Re: Japanese '者' (U+8005) is replaced with \350\200
Date: Sun, 10 Jan 2021 11:49:31 +0900

Dear Karl and Denver,

I am attaching a tar file just in case.

I found that the issue cannot be reproduced on my CentOS 7 virtual machine, on 
which wdiff was installed with yum.

-------------------------
$ uname -a
Linux localhost.localdomain 3.10.0-1062.el7.x86_64 #1 SMP Wed Aug 7 18:08:02 
UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
$ wdiff --version
wdiff (GNU wdiff) 1.2.2

Copyright (C) 1992, 1997, 1998, 1999, 2009, 2010, 2011, 2012 Free Software
Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Written by Franc,ois Pinard <pinard@iro.umontreal.ca>.
-------------------------

But the issue appears only on my macOS 10.15.7, on which wdiff was installed 
via Homebrew. Here is the minimum reproducible example. I guess the root cause 
exists somewhere in macOS libraries (or maybe GNU BSD library difference?) and 
it is probably beyond your maintenance coverage... 

$ cat input1_min.txt
者あ
$ cat input2_min.txt
あ者
$ wdiff input1_min.txt input2_min.txt
[-者あ-]{+あ?+}?

$ uname -a 
Darwin oxon2017.local 19.6.0 Darwin Kernel Version 19.6.0: Tue Nov 10 00:10:30 
PST 2020; root:xnu-6153.141.10~1/RELEASE_X86_64 x86_64
$ otool -L `which wdiff`                        
/usr/local/bin/wdiff:
        /usr/local/opt/gettext/lib/libintl.8.dylib (compatibility version 
10.0.0, current version 10.6.0)
        
/System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation 
(compatibility version 150.0.0, current version 1670.10.0)
        /usr/lib/libncurses.5.4.dylib (compatibility version 5.4.0, current 
version 5.4.0)
        /usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current 
version 1281.0.0)

$ wdiff --version
wdiff (GNU wdiff) 1.2.2

Copyright (C) 1992, 1997, 1998, 1999, 2009, 2010, 2011, 2012 Free Software
Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Written by Franc,ois Pinard <pinard@iro.umontreal.ca>.

-- 
OKUMURA, Akira oxon@mac.com / oxon@nagoya-u.jp
⌘ Junior Associate Professor at
- Institute for Space–Earth Environmental Research (ISEE)
- Kobayashi–Maskawa Institute for the Origin of Particles and the Universe (KMI)
Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
Office/Lab/Fax: +81 (0)52-789-4320/4324/4313
skype:okumura.akira

Attachment: wdiff.tar
Description: Unix tar archive

> On Jan 10, 2021, at 10:53, Denver Gingerich <denver@ossguy.com> wrote:
> 
> On Fri, Jan 08, 2021 at 07:35:19PM -0700, Karl Berry wrote:
>> In general, since the last release of wdiff was in 2014, it would not be
>> surprising if there were bugs in the utf-8 handling in the support code
>> or in wdiff itself.  Denver or Martin, are you there? --best, karl.
> 
> Yes, I'm here.  I don't recall working on anything related to UTF-8 support 
> during my time updating wdiff, nor do I recall specifically testing it.  So I 
> agree with Karl that it would not surprise me if there were bugs in how it 
> handles UTF-8.
> 
> Also, I only saw the two messages from Karl in this thread - please do let me 
> know if I'm missing some other context.  Thanks!
> 
> Denver
> https://jmp.chat/


reply via email to

[Prev in Thread] Current Thread [Next in Thread]