[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [RFC PATCH 1/5] checkpatch: add a check for utf-8 in co
From: |
Thomas Huth |
Subject: |
Re: [Qemu-devel] [RFC PATCH 1/5] checkpatch: add a check for utf-8 in commit logs |
Date: |
Mon, 30 Jan 2017 16:57:53 +0100 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.6.0 |
On 30.01.2017 15:12, Stefan Hajnoczi wrote:
> On Thu, Jan 26, 2017 at 02:11:01PM +0100, Thomas Huth wrote:
>> This patch is a port of the following commit from the Linux kernel:
>>
>> commit 15662b3e8644905032c2e26808401a487d4e90c1
>> Author: Joe Perches <address@hidden>
>> Date: Mon Oct 31 17:13:12 2011 -0700
>>
>> checkpatch: add a --strict check for utf-8 in commit logs
>>
>> Some find using utf-8 in commit logs inappropriate.
>>
>> Some patch commit logs contain unintended utf-8 characters when doing
>> things like copy/pasting compilation output.
>>
>> Look for the start of any commit log by skipping initial lines that look
>> like email headers and "From: " lines.
>>
>> Stop looking for utf-8 at the first signature line.
>>
>> Signed-off-by: Joe Perches <address@hidden>
>> Suggested-by: Andrew Morton <address@hidden>
>> Cc: Andy Whitcroft <address@hidden>
>> Signed-off-by: Andrew Morton <address@hidden>
>> Signed-off-by: Linus Torvalds <address@hidden>
>>
>> Signed-off-by: Thomas Huth <address@hidden>
>> ---
>> scripts/checkpatch.pl | 30 ++++++++++++++++++++++++++----
>> 1 file changed, 26 insertions(+), 4 deletions(-)
>
> This patch prevents including names with non-ASCII characters in the
> commit description. Some people care about the proper spelling of their
> names.
>
> Allowing UTF-8 in Signed-off-by and other headers isn't enough.
Right, and I guess the folks from the kernel checkpatch noticed this,
too. That's likely why the next patch restricts this check to only
happen if the patch is from a mail with non-UTF-8 content type, but
still contains UTF-8 characters, i.e. there is really something fishy
with the character set of the patch description. Maybe I should squash
the two patches together, so that it is more obvious what is going on here?
Thomas
signature.asc
Description: OpenPGP digital signature
- [Qemu-devel] [RFC PATCH 0/5] checkpatch: Test bad UTF-8 encodings and updates to MAINTAINERS, Thomas Huth, 2017/01/26
- [Qemu-devel] [RFC PATCH 1/5] checkpatch: add a check for utf-8 in commit logs, Thomas Huth, 2017/01/26
- [Qemu-devel] [RFC PATCH 2/5] checkpatch: check utf-8 content from a commit log when it's missing from charset, Thomas Huth, 2017/01/26
- [Qemu-devel] [RFC PATCH 4/5] checkpatch: emit a reminder about MAINTAINERS on file add/move/delete, Thomas Huth, 2017/01/26
- [Qemu-devel] [RFC PATCH 3/5] checkpatch: ignore email headers better, Thomas Huth, 2017/01/26
- [Qemu-devel] [RFC PATCH 5/5] checkpatch: reduce MAINTAINERS update message frequency, Thomas Huth, 2017/01/26