bug#19242: latest grep considers text files as binary

bug-grep

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#19242: latest grep considers text files as binary

From:	Thomas Wolff
Subject:	bug#19242: latest grep considers text files as binary
Date:	Fri, 05 Dec 2014 10:58:49 +0100
User-agent:	Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0

Paul Eggert wrote:

the mentioned patches are apparently intended to fix issues innon-UTF-8 locales.
No, they're also needed for UTF-8 locales I'm afraid. There are somesecurity issues, not only having to do with grep's internals, but alsofor the behavior of downstream programs that may be expecting UTF-8 text.
You can work around the problem with 'grep -a'.

I was aware of this workaround but I claim it should not be neededbecause the files affected are in fact not binary files but text files.The manual clearly says about -a: "Process a binary file as if it weretext" but partial content in a different text encoding does not make afile binary.


Jim Meyering wrote:

  this is due to documented and desirable behavior.

I deny this is desirable behavior and I doubt there is a security issueas described. If any other, independent software has a security issuewith non-UTF-8 input, it should decide itself to filter it and useaccordingly stable decoding functions. It cannot be the task of any tool(grep in this case) to filter output to work around possible securityissues in other programs in a pipe. This would be completely against theconcept of pipes in the Unix tradition.

Honestly I think this is another case of practical usefulness losingagainst dogma in software design.


Kind regards,
Thomas

[Prev in Thread]

Current Thread

[Next in Thread]

bug#19242: latest grep considers text files as binary, Thomas Wolff, 2014/12/01
- bug#19241: latest grep considers text files as binary, Paul Eggert, 2014/12/01
  - bug#19241: latest grep considers text files as binary, Jim Meyering, 2014/12/01
  - bug#19242: latest grep considers text files as binary, Paul Eggert, 2014/12/01
  - bug#19242: latest grep considers text files as binary, Thomas Wolff <=
    - bug#19242: latest grep considers text files as binary, Jim Meyering, 2014/12/05
    - bug#19242: latest grep considers text files as binary, Eric Blake, 2014/12/05
    - bug#19242: latest grep considers text files as binary, Eric Blake, 2014/12/05
    - bug#19242: latest grep considers text files as binary, Eric Blake, 2014/12/05

Prev by Date: bug#15444: address@hidden: Bug#734147: grep: colorisation corrupts character at end of line]
Next by Date: bug#19242: latest grep considers text files as binary
Previous by thread: bug#19242: latest grep considers text files as binary
Next by thread: bug#19242: latest grep considers text files as binary
Index(es):
- Date
- Thread