[Tinycc-devel] BUG: wide char in wide string literal handled incorrectly

tinycc-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Tinycc-devel] BUG: wide char in wide string literal handled incorrectly

From:	张博洋
Subject:	[Tinycc-devel] BUG: wide char in wide string literal handled incorrectly
Date:	Wed, 30 Aug 2017 15:30:55 +0800
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1

Hello,

I found that when TCC processing wide string literal, it behaves likedirectly casting each char in original file to wchar_t and store them inwide string. This will work for ASCII chars. However, it might not workfor real wide chars. For example:The Euro-sign (€, U+20AC) stored in UTF-8 is "E2 82 AC". In GCC, thischar stored in wide string will be "000020AC". However, in TCC, thischar is stored as 3 wide chars "000000E2 00000082 000000AC".I provided a patch, a test program and two screenshots that describethis problem, they are in attachments. I solve this problem by makingassumptions that input charset is UTF-8. Although it's not a perfectsolution, it's still better than "directly casting char to wchar_t". I'mwondering if that is appropriate, so please review the code carefully.


Thanks
Zhang Boyang

after-patch.png
Description: PNG image

before-patch.png
Description: PNG image

test.c
Description: Text Data

utf8.patch
Description: Text Data

[Prev in Thread]

Current Thread

[Next in Thread]

[Tinycc-devel] BUG: wide char in wide string literal handled incorrectly, 张博洋 <=
- Re: [Tinycc-devel] BUG: wide char in wide string literal handled incorrectly, Christian Jullien, 2017/08/31
- [Tinycc-devel] BUG: wide char in wide string literal handled incorrectly, 张博洋, 2017/08/30

Prev by Date: Re: [Tinycc-devel] BUG: called function should pop the arguments when using fastcall
Next by Date: [Tinycc-devel] BUG: wide char in wide string literal handled incorrectly
Previous by thread: [Tinycc-devel] BUG: called function should pop the arguments when using fastcall
Next by thread: Re: [Tinycc-devel] BUG: wide char in wide string literal handled incorrectly
Index(es):
- Date
- Thread